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Chapter 1 


INTRODUCTION 


The ELIM-COMPLIP System, developed by the General Research Corporation 
(GRC) and the predecessor organization. Research Analysis Corporation (RAC), 
is one of the key tools used for Army manpower planning. The major com¬ 
ponents of the system are ELIM (^nlisted IjOss Inventory Model), which is 
used to monitor, analyze, and make the official forecast of enlisted 
losses, and COMPLIP (Computation of Manpower Programs Using Linear Pro¬ 
gramming) , which is used to generate the official Army manpower program, 
as well as programs used to evaluate policy alternatives. A recent addi¬ 
tion to the system is a Gains Module, with the capability to forecast 
immediate reenlistments and the available quantities of various user- 
defined categories of supply-limited no-prior-service (NPS.) gains. The 
capability to forecast immediate reenlistments was incorporated into 
existing modules of the system. However, a separate module was developed 
to forecast the supply-limited NPS gains. This is known as the NPS Gains 
Module (NPSGM) and is the subject of this document. 

The ELIM-COMPLIP System has been documented in three volumes.^ This 

documentation is currently in the process of being revised into four volumes 

2 

to include the recent additions. 


^Holz, B. W., et al, "The ELIM-COMPLIP System of Manpower Planning 
Models, Three Volumes, General Research Corporation, OAD-CR-18, December 
1973. 

^Holz, B. W., et al. Two New Versions of the ELIM-COMPLIP System, 
Four Volumes, General Research Corporation, OAD-CR- , in process of 
being published. 
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SUMMARY OF THE ELIM-COMPLIP SYSTEM 


General 

The ELIM-COMPLIP System is used by the Manpower Programs Division 
(MPD) of the Office of the Deputy Chief of Staff for Personnel (ODCSPER) 
to produce the official Army manpower program, as well as to generate 
programs for use by the Department of the Army (DA) and Department of 
Defense (DOD) in the examination of Army manpower policy alternatives. 

The manpower program is a forecast of various categories of Active 
Army strength, gains, and losses and the Reserve Enlisted Program (REP) 
of entry on active duty for training (ADT) . A manpower program—^^^hich 
covers each month of the current fiscal year (FY) , sometimes the imme*- 
diately preceding FY, and from two to six future FYs—^reflects the 
current status of Amy manpower, recent past experience, and plans and 
assumptions concerning the future. 

Inputs to the system include a variety of historical data, a large 

part of which is input from other automated systems, and a number of user 

specifications. Included in the latter are the following: (a) objectives 

(targets) for the Army’s operating strength; (b) any applicable limitations 

on total end strength and/or man years; (c) projections of officer gains 
* 

and losses and prior-service (PS) enlisted gains; (d) specifications 
concerning policies governing such matters as enlistments, reenlistments, 
extensions of terms of service, and various types of early release for 
enlisted personnel; (e) training objectives for the REP; and (f) the 
programmed capacity of the training base. 

A number of types of output reports and graphical displays of both 
input and output data are available to assist the user in analyzing the 
effect of postulated policies and other assumptions. Outputs from the 
system have been used for such purposes as the following: (a) decisions 
about draft calls during the period FY70 to FY73; (b) evaluation of the 

* 

There is an option that permits COMPLIP to compute these projections 
in the light of user specifications concerning the officer force. 

ick 

During most of this period only the COMPLIP part of the system was 
in existence. 
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effect of the discontinuation of the draft and determination of the 
requirements for volunteer enlistments; (c) preparation of the budget 
for military personnel (MPA); (d) preparation of Program Objectives 
Memorandum (POM), submitted annually to the Office of the Secretary of 
Defense (OSD); (e) consideration of proposed discharge programs, such 
as that used currently to screen recruits during the first six months 
of service; and (f) planning for the training of recruits. 

ELIM 

The function of ELIM is to produce forecasts of enlisted losses. 

ELIM accomplishes this by applying loss rates to the strengths of corres¬ 
ponding elements of the enlisted population. The loss rates are derived 
from historical data, subject to user modification, when desired, to reflect 
assumptions concerning the effect of postulated changes in policy or prac¬ 
tice from that reflected in the historical data. For certain types of 
loss—specifically losses associated with any special early release 
policies—^ELIM relies entirely on user-specified factors. 

The population used by ELIM is a profile of the enlisted inventory 
derived primarily from the Enlisted Master File (EMF). The objective is 
to use as a base for loss projections information concerning the enlisted 
population that is the most recent available and that describes the popula¬ 
tion in terms of characteristics that can be expected to have an important 
influence on the frequency with which losses of various kinds occur. 

The model makes separate projections for a number of different causes 
of loss associated with a number of different population categories. Losses 
are grouped into categories that reflect major manpower policies and/or 
consist of components that are relatively homogeneous in the way they vary 
with time and with the population variables used in the projection process. 

The three versions of ELIM—^ELIM-I, ELIM-II, and ELIM-III—differ with 

respect to the breakouts used for the first term (FT) Regular Army (RA) 

population. In all three versions the RA population is broken out by FT 

3 

and career, where the distinction is that of the DCSPER—46 report, i.e., 
careerists are those who have more than 36 months of service, with all 
others designated FT. In ELIM-I the FT popvilation is broken out by term 
of service and months to expiration of term of service (ETS). 

■^Dept. of Army, "Strength of the Army," DCSPER-46, published monthly. 
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In ELIM-II a distinction is made between "first timers" (FTI)— 
those who are serving on their first enlistment contract—and "second 
timers" (STI)—those who have either reenlisted or extended the term of 
service, but according to the DCSPER~46 definition are still designated 
FT* The FTI are broken out by term of service and months to ETS, while 
the STI are broken out only by months to ETS. The breakout of FTI vs 
STI enhances projection accuracy and provides additional historical 
strength and loss data that are useful for other applications. 

In ELIM-III there is a further breakout of FTI in the first 21 months 
of service into a maximum of four groups, designated characteristic groups 
or C-groups, where the user specifies the definition of these classes in 
terms of characteristics such as age, race, sex, civilian education, and 
scores on classification tests. For example, C-group 1 might consist of 
high school graduates; C-group 2 of those classified in mental group 1, 2 
or 3 who have not graduated from high school; C-group 3 of mental group 4 
non-graduates who are aged 18-20; and C-group 4 of all others. It is 
anticipated that the user will vary these definitions to correspond to 
specific policies that are in effect or under consideration—e.g., con¬ 
straints on the input of high school graduates or those classified in mental 
group 4. Another consideration bearing on the user's specification of C-group 
definitions is the influence of certain characteristics—e.g., civilian educa¬ 
tion, race, and age—on loss projection errors. 

COMPLIP 

The function of COMPLIP is to generate an optimal manpower program— 
i.e., a program that both satisfies all of the user-specifications, if it 
is feasible to do so, and is optimal in some sense, where the user can 

"k 

exercise some choice with respect to the criterion for optimality and a 
wide range of choice concerning constraints on the manpower program. 

Typically the model operates as follows: given user specifications con¬ 
cerning such matters as operating strength targets, constraints on various 
kinds of strength—e.g., total average strength (man years) and/or total 
end strength—^and plans for the REP and the training base, COMPLIP determines 
the monthly levels of untrained (i.e., NFS) accessions for the Active Army 

k 

Usual practice is to specify two or more criteria to be used in 
sequence. 
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that bring projected operating strength into the closest possible agree¬ 
ment with the monthly targets. Once this has been accomplished, annual 
REP entry on ADT is maximized in such a way that monthly inputs to basic 
training centers are smoothed as much as possible. 

The new version of COMPLIP, C0MPLIP-G2, provides a number of new 
options with respect to model formulation. The most important of these 
is the capability to deal explicitly with the breakout of FT enlistees 
by the C-group discussed previously in connection with ELIM-III. Thus, 
constraints can be imposed, as appropriate, on the projected availability 
and allowable input of recruits corresponding to each C-group. Further¬ 
more, loss rates applicable to each C-group are applied over the first 21 
months of service. An automated Matrix Generator for C0MPLIP-G2 facilitates 
the tailoring of the model for each application. 

System Linkages 

Automated linkages exist between the various modules of the system. 

To facilitate use, each type of user—supplied data must be input to the 
system only once. When the same data element is required by more than 
one module or program it is passed automatically from one to the other. 
Further, when a new run is made, the only inputs that must generally be 
supplied are those that differ from data used in the preceding run. 

THE NFS GAINS MODULE 

The categories of enlisted gains listed in the manpower program are 
given in Table 1, with FY75 monthly averages for each category. On the 
average, there were approximately 25,000 gains per month. About 63 percent 
were NFS gains and 23 percent immediate reenlistments. Reenlistments within 
2 to 90 days, reenlistments after 90 days, returns to military control. 
Reserve Components, and administrative gains accounted for the remaining 
14 percent. 

Design specifications were developed for methods of projecting each 

* 

of these gains categories except Reserve Components. However, in con¬ 
sideration of both the significance of the category and the estimated 


A discussion of these design specifications is contained in the 
Fhase I Report of the study, "ELIM-COMFLIF System Improvement." 
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level of effort required to develop the forecasting capability, the 
Study Advisory Group (SAG) directed that development be limited to 
programs associated with the two most numerous types of gains, NFS 
and immediate reenlistments. 


Table 1 

CATEGORIES OF ENLISTED GAINS 


1 

._Category_ __ j 

Monthly average 
in FY75 

Percent of 
total 

No-prior-service (NFS) 

15,806 

62.5 

Immediate reenlistment 

5,852 

23.1 

Reenlistment within 2-90 days 

192 

0.8 

Reenlist after 90 days 

1,493 

5.9 

Returns to military control 

1,704 

6.7 

Reserve components 

112 

0.4 

Administrative gains 
(all other) 

145 

0.6 

Total 

25,304 

100 


Figure 1 is a system schematic depicting conceptually the extension 
of the system to include a Gains Module. Inputs to the module come from 
the ELIM data base, augmented by some additional data from the Modem 

it 

Volunteer Army (MVA) master file. Outputs of the Gains Module are input 
to the ELIM Inventory Projection Module ClPH) and COMPLIP. 

A previous GRC study, titled “Evaluation of Army Manpower Accession 
Programs (EAMAP)," developed a nonlinear regression system to forecast 
Army volunteers and to compute the corresponding seasonal coefficients— 
i.e., the factors reflecting the seasonal patterns of enlistments of 
various groups—e.g., high school graduates and non-graduate mental groups 
1, 2, and 3. The system, which is operational and has been used success¬ 
fully, has been adapted for incorporation into the Gains Module for use 
in forecasting categories of RA NFS gains that are supply-limited. 

* 

This file was developed by another GRC study. 
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Projections of those NFS gains that are demand-limited—such as perhaps 
non-graduate mental group 4s—can be determined by means of the mechan¬ 
isms provided in C0MPLIP-G2. 

The nonlinear regression is based on historical time series data 
on enlistments and related variables. Forecasting volunteers in a no¬ 
draft environment, based in part on historical data when the draft was 
in effect, requires that the historical data that are used be restricted 
to true volunteers. Several methods have been developed to separate the 
enlistments during the period of the draft into draft-induced and true 
volunteers. The so-called ”GRC maximum" method (to be discussed later) 
is being used to estimate the true volunteers. 
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Chapter 2 


SYSTEM DESCRIPTION 


The function of the NPSGM is to provide a means by which forecasts 

of RA NFS gains can be made and the corresponding seasonal coefficients 

can be computed. The seasonal coefficients are factors reflecting the 

seasonal patterns of enlistments of various groups—e.g., high school 

graduates, and non-graduate mental groups I, II, and III. The method 

used is that of nonlinear multiple stepwise regression. This method 

4 

was developed by GRC in a previous study. It consists of a nonlinear 
portion used to determine seasonal factors and a linear portion used to 
determine the regression coefficients for the independent or explanatory 
variables reflecting policies, programs, and economic and other environmental 
conditions. The linear portion uses the BMD02R linear multiple stepwise 
regression program taken from the statistical package compiled by W. J. 

Dixon of the University of California in Los Angeles (UCLA). The package 
is known as the UCLA Biomedical Computer Programs.^ 

The procedure for forecasting RA NPS gains consists of four major 
steps. These are shown schematically in Fig. 2. The first step is the 
development of historical time series of 250 categories of RA true volvin- 
teers defined on the basis of such characteristics as sex, civilian 
education, and race. The population breakouts are maintained for subse¬ 
quent regression runs. The second step-is to prepare the regression input 

^Grissmer, D. W., et al, "An Evaluation of Army Manpower Accession 
Programs," General Research Corporation, April 1974. 

^W. J. Dixon, "Biomedical Computer Programs," University of California 
Publications in Automatic Computation No. 2, University of California 
Press, 1970. 
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Note: - - - ^ Indicates control flow 



























by aggregating the detailed population breakouts into the desired set 
of subgroups, adding the time series of the selected explanatory (or 
independent) variables, and compiling control cards for the regression 
program. The third step is to exercise the regression model in order 
to produce the forecasts for each of the desired set of population sub¬ 
groups. The fourth step is to aggregate the forecasts of the subgroups 
into at most four groups referred to as characteristic groups or C-groups 
used by COMPLIP, compute seasonal coefficients for each of the C-groups 
based on the aggregated forecasts, and produce printer graphs of the 
historical time series and of the time series computed by the regression 
equation for both the historical time frame, and 12 months of forecast. 
Graphs are produced for each of the population categories for which a 
regression run was made. 

DEVELOPMENT OF HISTORICAL DATA 

The true volunteers during the draft era are estimated using the 
"GRC maximum" method. This method stems from the relationship between the 
number of enlistees during the draft era and the lottery sequence numbers 
(LSNs), as shown in Fig. 3. The graph confirms the conviction that many 
of the enlistees volunteered because of draft pressure. Note the leveling 
off of the graph after LSN 240. This characteristic formed the basis for 
concluding that volunteers with LSN greater than 240 were true volunteers, 
since they were in little or no danger of being drafted. 

A precise formulation of the estimating relationship is given by the 
following equation: 

+ D^q + C366/126) (E + D) (1) 

where equals the estimated number of true volunteers, E^^^equals the 
number of RA volunteers without LSNs, D^^ equals the ninnber of volunteers 
for the draft without LSNs, E equals the number of RA volunteers with 
LSNs 241 to 366 and D equals the number of volunteers for the draft with 
LSNs 241 to 366. The rationale behind this estimating relation is as 
follows. Volunteers with LSNs in the range 241 to 366 are considered to 
be true volxinteers. The number of LSNs in this range is 126, while the 
total number of LSNs is 366. The ratio 366/126 is used in Eq 1 for the 
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following reasons: (a) since LSNs are drawn randomly, individuals with 
LSNs 241 to 366 are a random sample of all individuals with LSNs; and 
(b) the total expected number of true volunteers of individuals with LSNs 
includes a proportionate number with LSNs in the range 1 to 240. All 
enlistees without LSNs are assumed to be true volunteers. 

The estimating procedure is xised to tabulate true volunteers for 
each month through June 1973. After June 1973 the accessions are tabulated 
directly. The frequency tabulations are made for each cell of two popula¬ 
tion partitions, given in Table 2, for each month for which historical data 
exists. ThuSj a time series is formed for each population cell. Partition 
1 includes categories 1 through 196, where the breakouts are based on sex, 
civilian education, race, mental group, age, and bonus. Partition 2 includes 
categories 197 to 250, where the breakouts are based on term of service, 
civilian education, race, and mental group. Both partitions are mutually 

exclusive and exhaustive partitions of the population. The sources for 

* ** 

these data are the MVA master file for the draftee true volunteers and 
the ELLM-III cohort file for the EA true volunteers. Table 3 gives the MVA 
master file format and Table 4 gives the ELIM-III cohort file format. Monthly 
additions to the volunteer time series data base can be made by means of an 
update program that uses the Gains transaction file as input. The format 
of the Gains transaction file is given in Table 5. 

REGRESSION INPUT 

The second step in this forecasting procedure is to prepare the input 
for the regression program. This is done by means of a regression input 
generator that produces a file containing control cards for the regression 
program, and the time series for each dependent (i.e., volunteer category) 
and independent or explanatory variable to be used in the projection process. 
In order to form the time series for the desired volxmteer categories, the 
user specifies the categories of Table 2 to be aggregated. The aggregations 
are based either on partition 1 or partition 2, not both. 

This file was created mder a previous GRC study. 

Extracting these data from the MVA Master File is a one-time operation. 
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CATEGORIES FOR NFS GAINS FREQUENCY DISTRIBUTIONS 
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Volunteers for: draft and RA 










Table 3 


MVA MASTER FILE FORMAT 


I. MVA MASTER FILE 

A. Density - 800 BPl 

B. Mode - BCD 

C. Record size - 120 characters 

D. Blocking factor - 40 records/block; 4800 characters/block 

E. Label records - none 

F. Parity - even 


DATA ELEMENTS 





Variable name 

Chars 

Pos 

Ty^e 

Comments 

1. 

5-digit ZIP code 

5 

1-5 

N 

No blanks 

2. 

Date of birth ^ 

6 

6-11 

N 

No blanks 

3. 

Branch of service 

1 

12 

N 

From ACC or DEP- 
Branch-Service 

4. 

Lottery catego^ 

Lottery number' 

1 

13 

N 


5. 

3 

14-16 

N 

Zero fill 

6. 

Term of enlistment 

1 

17 

A 


7. 

Enlistment option code 

4 

18-21 

A 


8. 

Type entry 

1 

22 

N 

If ACC date=0, 
then 9 

9. 

Accessiog date 

6 

23-28 

N 

Zero fill 

10. 

DEP date 

6 

29-34 

N 

Zero fill 

11. 

AFQT category 

1 

35 

N 

Zero fill 

12. 

AFQT score 

3 

36-38 

N 

Zero fill 

13. 

Race 

1 

39 

N 

1-7 

14. 

Education 

1 

40 

A 


15. 

Number of dependents 
(not counting self) 

1 

41 

N 

0-9 

16. 

Last service 

1 

42 

A 


17. 

Sex 

1 

43 

A 

M or F 

18. 

State 

2 

44-45 

A 


19. 

Blank 

3 

46-48 

A 


20. 

Blank 

2 

49-50 

A 


21. 

Marital status 

1 

51 

A 


22. 

Lottery location 

1 

52 

N 

1 = 000-60 

2 = 61-120 


3 = 121-180 

4 = 181-240 

5 = 241-300 

6 = 301-366 
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Table 3 (cont'd) 


Variable name 

Chars 

Pos 

'Type 

Comments 

23. RMS 

3 

53-55 

A 


24. Training commitment 

4 

56-59 

A 


25. Waiver 

1 

60 

A 



26. 

SSN 

9 

61-69 

N 

Specifications on temporary 
SSN are currently a mystery 

27. 

Status code 

1 

70 

A 

Extracted from DEP or ACC 
status code. 

00 

CM 

AQB 

30 

71-100 

N 

999 = no test scores avail¬ 
able, Jan 70 through Mar 73 
use psn 71-91. 

29. 

Recruiter code ^ 

7 

101-107 

A 


30. 

Transaction date 

6 

108-113 

N 

Year/month/day either ACC 
date or DEP date. 

31. 

Age 

2 

114-115 

N 

Attained age in years (not 
rounded) at DEP or ACC date 

32. 

Religion 

2 

116-117 

A 

ACC date. 

33. 

Blank 

3 

118-120 

A 



Validity checks have been performed on these fields. 
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Table 4 


:k 

ELIM III COHORT FILE FORMAT 


Characters 


SSAN or TIN 

1 - 9 

Filler 

10 - 10 

Date of birth 

11 - 16 

AFQT score 

17 - 18 

Sex 

19 - 19 

Race 

20 - 20 

Term of enlistment 

21 - 21 

Training commitment 

22 - 25 

Civilian education 

26 - 26 

Mental category 

27 - 27 

Enlistment options 

28 - 31 

AQB score summary 

32-41 

AQB category 

42 - 42 

Lottery number 

43 - 45 

Number of dependents 

46 - 46 

Marital status 

47 - 47 

Moral waiver 

48 - 48 

Age in months 

49 - 51 

Number of transactions 

52 - 53 

Date of transaction 

54 - 57 

Type of transaction* 

58 - 60 


131 - 134 


135 - 137 


Code description 


(YYMMDD) 

99-invalid 

M, F, 9-invalid 

1-white, 2—black, 3-other; 

9-invalid 

9-invalid 

4 char MOS (not edited) 

9-invalid; 0 to 8-no HS; A,B,C,D- 
some HS; Y or F-GED; other-HS 
graduate 

1,2,3,4,5; 9-invalid 

1st char: 1-combat arms, 2-service 

schools, 3-other, 4-RA (unassigned) 

option; 

Is t char: B-bonus 

each character represents a test 

score 0—invalid; 1,2-score < 90; 

3,4 - (90,99); 5,6,7 - (100,109); 
8,9 - score = 110 

999-invalid 
9-invalid 


999-invalid 
01 - 12 
(YYMM) 

GHF - NFS gain 


The codes on this file have been checked for validity. 

*"'Date and type of transaction codes may be repeated up to a total of 
12 sets. 
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Table 5 


GAIN AND EXTENSION RECORD FORMAT 
File Identification; BE ME 0013 

Record Identification: Gains Transactions/Extensions Transactions 

DCSPER-46, Part II (Edit) 


Relative 

positions 

Name of data element 

No. of 
characters 

Type of 
characters 

1-15 

Name personnel 

15 

M 

16-17 

Sending PPA code 

2 

M 

18 

Blank 

1 

M 

19-20 

Service number prefix 

2 

A 

21-29 

Social security account number 

9 

M 

30-32 

Grade in which serving (abbr) 

3 

M 

33 

Grade in which serving 

1 

M 

34-36 

Months service for pay 

3 

N 

37-41 

Primary Military occupational specialty 5 

M 

42 

Race 

1 

A 

43 

Service component 

1 

A 

44 

Term of service or enlistment 

1 

N 

45-46 

Current assignment 

2 

M 

47-49 

Blank 

3 

M 

50 

Previous regular Army service 

1 

M 

51-52 

Year service code Generated 

2 

M 

53-56 

Expiration of term of service (yr mo) 

4 

N 

57-60 

Parent unit and morning report indicator 4 

M 

61-62 

Sub unit code 

2 

M 

63-64 

Type transaction 

2 

M 

65-70 

Transaction date (yr mo da) 

6 

N 

71 

Assignment code Generated 

1 

M 

72 

ETS PETS code Generated 

1 

N. 

73 

Reception station code 

1 

M 

74-76 

Separation program number (previous) 

3 

M 

77-80 

Process yr mo Generated 

4 

N 

81-82 

Category code Generated 

2 

N 

83-86 

Basic active service (yr mo) 

4 

A 

87-88 

Status 

2 

M 

89 

Civilian education code 

1 

M 

90 

Mental group 

1 

N 

91-92 

Armed forced qualification test score 

2 

N 

93-98 

Date of birth (yr mo da) 

6 

N 

99 

Sex 

1 

A 

100-101 

Receiving PPA code 

2 

M 

102-105 

Enlistment options 

4 

M 

106-109 

Basic pay entry (yr mo) 

4 

N 

110 

Dual service component status 

1 

A 
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Table 5 (cont'd) 


Relative 

positions 

Name of data element 

No. of 
characters 

Type of 
characters 

111 

Special personnel category 

1 

A 

112-115 

ETS of previous service (yr mo) 

4 

N 

116 

Propay 

1 

N 

117-130 

Blank 



131-132 

PMOS evaluation score 

2 

N 

133 

Overseas/CONUS code 

1 

M 

134 

Military personnel class 

1 

A 

135 

Record mark 

1 

M 


^An A means alphabetic, N means numeric; M means that the field may 
contain any characters. 
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The NPSGM permits the user to make aggregations for a maximum of 
20 volunteer categories that are used as dependent variables* It is 
not necessary that these aggregate categories be either mutually exclusive 
or exhaustive. The user will be provided with a message listing any 
duplicated or omitted categories. However, for the purpose of generating 
input for the ELIM-COMPLIP System, it is necessary that the set of dependent 
variables be mutually exclusive. On the other hand, they will not in 
general be exhaustive, since regression will be used to project only those 
NFS gains categories that are supply limited. An example of a mutually 
exclusive but not exhaustive set of dependent variables is given in Table 6. 

Table 6 

EXAMPLE OF CATEGORIES FOR WHICH REGRESSION RUNS 

CAN BE MADE 

(Aggregated from Population Partition I) 

1. Females 

2. Male, HSG, Black, Cat I, II, III 

3. Male, HSG, Black, Cat IV, V 

4. Male, HSG, Not Black, Cat I, II, III 

5. Male, HSG, Not Black, Cat IV, V 

6. Male, NHSG, Black, Cat I, II, III 

7. Male, NHSG, Not Black, Cat I, II, III 

The forecasts for each dependent variable are based on selected 
independent variables. Examples of independent variables are military pay, 
unemployment rate among the 16-21 year old male out-of-school labor force, 
and advertising expenditures for accessions. The file of independent vari¬ 
able data shown in the schematic of Fig. 2 contains historical values for 
a set of independent variables that have proved useful in the projection 
of volunteer enlistments in other GRC studies. A list of such variables 
is shown in Table 7. Historical values of each of the independent variables 
have been obtained from previous GRC studies. Values of these variables 
must be kept current by the user. For the 12 months for which forecasts 
will be made, the user has the option of specifying the values for the 
corresponding independent variables or of having the program extend the 
last available value to the end of the projection period. 
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Table 7 


DEFINITIONS OF INDEPENDENT VARIABLES 

^LQY A dummy variable that is set during the time period when 

recruiter credit was not given for Cat III non—high 
school enlistees and is 0 otherwise. The phasedown of^ 
the values represent the gradual withdrawal of the policy. 

AQOTDM The variable is the number of males to be recruited by 

the Army. The quota is set by the Department of the Army. 

BNSHSG A dummy variable that is set to 1.0 each month in which 

the $1500 enlistment bonus is in effect for high school 
graduates. 

BNSINC A dummy variable that is set to 1.0 each month in which 

the $1000 incremental bonus is in effect for enlistees. 

BNSKLI* A dummy variable that is set to 1.0 for May and June 1973 

and June 1974 and subsequent months in which a bonus was 
in effect for skills. 

BNSKL2 A diimmy variable that is set to 1.0 each month beginning 

in Jxane 1974 in which a bonus is in effect for skills. 

BNSNHS A dummy variable that is set to 1.0 each month in which 

the $1500 enlistment bonus is in effect for non-high school 
graduates. 

CAOPTS Niimber of combat arms options available. 

CAT4LM The percent of NPS enlistments permitted as mental 

Category IV. 

DUNEMP Deseasonalized imemployment rates for 16-21 out-of-school 

males. 

HSPL74 A dummy variable that is set to one when maximum HS grad 

policy is in effect. The withdrawal of the policy is 
modeled by a ramp since a period of time is required 
for the recruiter to develop the non-HS market again 
(in test set equal to 1.0 in April 73 with a ramp before 
and after this month). 

It is recommended that the bonus for skills for June 1974 and subse— 

quent months be reflected by BNSKL2 only and the corresponding values for 

BNSKLl be set to zero. 
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Table 7 (cont*d) 


MICIPA 


OPTSTO 


PAIDTV 


PASURG 

PRTMED 

RECASS 

RECR 

TIME 

TYOPT 

TYOPTN 

UOCCAN 


The ratio of military RMC (regular military compensation) 
for grade E^-l to the civilian average weekly wages for 
two industries—Wholesale and Retail Trade Services, 

The number of combat arms and service schools options, 
it measures the number of new separate options avail¬ 
able to an incoming recruit to the Army, 

This variable measures the number of paid TV advertise¬ 
ments sponsored by the Army during their paid TV and 
radio advertising campaign. Source for the data is 
"Effectiveness of the Modem Volunteer Army Advertising 
Program," prepared by Stanford Research Institute for 
OSAMVA, December 1971. 

Dummy variable designed to reflect the impact of a new 
pay raise. 

The number of media insertions placed in national circu¬ 
lation magazines plus the number of national newspaper 
campaigns. 

Number of Army recruiter assistants assigned to Hometown 
Recruiter Assistant Plan each month. 

Number of Army recruiters on production each month. 

A variable measuring increasing time, 

A dummy variable reflecting when options are available 
for two year enlistees who are high school graduates. 

A dummy variable reflecting when options are available 
for two year enlistees who are non-high school graduates. 

Number of Army unit of choice canvassers. 
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Sometimes it is desirable to use independent variables that have been 
lagged by a few months to reflect the possibility of a delayed effect on 
the volunteers. The program permits up to nine copies of each variable, 
incorporating the desired lag. Copies of a variable including the variable 
itself are designated a family. A limit can be imposed on the number of 
members of a family that can be in the regression solution at any one time. 
In practice, this limit has usually been one. 

Provision is made for a sign (i.e., plus, minus, or blank for either) 
to be associated with each independent variable to control the sign of its 
coefficient upon entering the regression equation. For most variables 
except time use of this option is desirable because the objective is not 
only to obtain a good fit of the equation to the historical data but also 
to obtain an accurate forecast 12 months into the future. Without sign 
control a good fit may be obtained but a trend may be established between 
independent and dependent variables that has an adverse effect on the pro¬ 
jected values. With sign control a logically sound model can be constructed 
that establishes the appropriate type of correlation—i.e., positive or 
negative—between dependent and independent variables. For example, the 
variable reflecting a bonus for high school graduates should have a plus 
sign when used with the dependent variable that represents high school 
graduates. An example with a negative sign restriction is the case of 
using the independent variable "high school policy 74" with dependent 
variables representing not-high school graduates. Since the policy was 
designed to maximize the intake of high school graduates, it tended to 
minimize the not-high school graduate accessions and hence the variable 
reflecting this policy is negatively correlated with the dependent vari¬ 
ables reflecting not-high school graduates. 

For each dependent variable a set of independent variables that are 
candidates to enter the regression equation must be specified. Examples 
of dependent and independent variable combinations are given in Table 8. 

The reason the user specifies the independent variables is to eliminate 
those that are not applicable for the given dependent variable. Some 
independent variables are obviovisly not applicable for a given dependent 
variable and experience may teach that certain other independent variables 

For example, it is obvious that the independent variable representing 
the combat arms bonus is not applicable to the dependent variable repre¬ 
senting females. 
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Table 8 


EXAMPLES OF DEPENDENT AND CORRESPONDING INDEPENDENT VARIABLES 


Independent variables 

Mnemonics 

Sign 

restric¬ 

tion 


Dependent 

variables 

HSG 

black 

HSG 

not black 

Not HSG 
black 

Not HSG 
not blacl< 

Military civilian pay ratio 

MICIPA 

+ 

X 

X 

X 

X 

Recruiters 

RECR 

+ 

X 

X 

X 

X 

Recruiters (-1) 

RECR 

4 * 

X 

X 

X 

X 

Recruiters (-2) 

RECR 

+ 

X 

X 

X 

X 

Recruiter assistants 

RE CASS 

+ 

X 

X 

X 

X 

Recruiter assistants (-1) 

RE CASS 


X 

X 

X 

X 

Recruiter assistants (-2) 

RE CASS 

+ 

X 

X 

X 

X 

Unit of choice canvassers 

UOCCAN 


X 

X 

X 

X 

Unit of choice canvassers (-1) 

UOCCAN 


X 

X 

X 

X 

Unit of choice canvassers (-2) 

UOCCAN 

+ 

X 

X 

X 

X 

Combat arms options 

CAOPTS 

+ 

X 

X 



Combat arms options (-*1) 

CAOPTS 

H- 

X 

X 



Combat arms options (-2) 

CAOPTS 

+ 

X 

X 



Bonus - HSG 

BNSHSG 

+ 

X 

X 



Bonus - NHSG 

BNSNHS 

+ 



X 

X 

Bonus - Increment 

BNSING 

4 - 

X 

X 



Bonus - Skills 

BNSKL2 

4 - 

X 

X 



Two year option (HSG) 

TYOPT 

4 - 

X 

X 



Two year option (NHSG) 

TYOPTN 

4 - 



X 

X 

I 

Total options 

OPTSTO 

+ 



X 

X 

Total options (-1) 

OPTSTO 

4 - 



X 

^ 1 

Total options (-2) 

OPTSTO 

+ 



X 

^ ! 

Pay surge 

PASURG 

4 - 

X 

X 

X 

X 

Cat IV limit 

CAT4LM 

4 - 

X 

X 


1 

i 

1 

Print media 

PRTMED 

+ 

X 

X 


1 

Unemployment 

DUNEMP 

+ 

X 

X 

X 

X 

Unemployment (-1) 

DUNEMP 

4 - 

X 

X 

X 

X 

Unemployment (-2) 

DUNEMP 

4 * 

X 

X 

X 

X 

High school policy 74 

HSPL74 

- 



X 

X 

Army policy 

APLCY 

- 



X 

X 

Time 

TIME 


X 

X 

X 

X 


Definitions of the independent variables are given in Table 7. 
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do not contribute to the regression equations of certain dependent 
variables. The obvious advantage of eliminating as many of the 
independent variables for each dependent variable as possible is that 
it reduces computer running time. Another advantage is that the regres¬ 
sion equation will be based on meaningful variables. Forecasts for 
12 months into the future can be expected to be more reliable when 
based on equations containing variables that have a logical relation¬ 
ship than when based merely on a statistical relationship. 

REGRESSION ANALYSIS 
Methodology 

In an earlier GRC study a nonlinear regression model was developed 
that was incorporated into the BMD02R program of the UCLA BMD computer 
package. As indicated previously, the nonlinear portion of the combined 
model is used to determine the multiplicative seasonal factors, while 
the linear portion uses a stepwise procedure to determine the coefficients 
of the independent variables in the regression equation. The analysis 
assumes the dependent variable time series is initially in unadjusted or 
seasonalized form and the independent variables are in deseasonalized 
form (e.g., deseasonalized unemployment rates). This, however, is not 
essential for the methodology of the model to hold. It is only desirable 
in the sense that the model computes seasonal adjustment factors for the 
dependent variable. The less seasoned variation of the dependent variable 
is explained by the independent variables, the more is explained by the 
seasonal adjustment factors. These seasonal adjustment factors should not 
be confused with the seasonal coefficients that are computed from the 
forecast values and forwarded to C0MPLIP~G2. 

The model is solved iteratively for the seasonal adjustment factors 
and the coefficients of the linear model. First, the seasonal coefficients 
are set to 1.0 and BMD02R is used to solve for the linear regression coef¬ 
ficients. These are then fixed and seasonal coefficients are determined 
by means of a least squares fit. The seasonal coefficients are then fixed 
and a new set of linear regression coefficients is determined. The pro¬ 
cedure is repeated until convergence is reached. A schematic of the proce¬ 
dure is shown in Fig. 4. 

•k 

See Reference 4. 
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INPUTS 



Convergence 

criterion 

satisfied 


OUTPUTS 


1. Deseasonalization factors 

2. Coefficients of the variables 
in the model 

3. Residual errors, standard 
error of estimate, coeffi¬ 
cient of multiple determina¬ 
tion, and correlation matrix 

4. Forecasts 


Fig. 4—Basic Cycle of the Deseasonalized Step Regression 
Model 











The stepwise procedure requires two user specified F statistics 
as tolerances for bringing independent variables into the solution 
and for removing them. Suppose and F^ are such tolerances, where, 

F ^ F^. Let F denote the F statistic of the variable with the 

12 V 

largest F statistic that is not in the solution. Then, if F^ > F^ 
and the variable satisfies the user imposed entering restrictions 
(i.e., sign restriction and limit on the number of members per family 
that can be in the solution at any one time), it is entered into the 
solution. Similarly, for removing variables from the solution, if 
F < F,, the variable is excluded from the solution. 

V 1 

The model requires the following user input: 

. 1. A dependent variable Y = col (y^, which is typically 

a time series with n representing the number of observed values. 

2. An independent variable data matrix X = [x. , an n by p matrix 

where p is the number of independent variables. Each coltunn vector repre¬ 
sents a time series of n observed values for an independent variable. 

3. The niimber of observations, n, of the (seasonalized) dependent 
variable Y. The number of observations should be more than one cycle, r 
(r = 12) of the seasonal period (say, at least two cycles). 

The model has the following mathematical form: 


-1 


” — 



a + X-. a + a^ + . .. + x- a 

0 11 1 12 2 Ip P 

-1 



• 

• j 

-1 ^ 

! 

! 

1 i 

a^ + X a + X ^ a + ... + x a 

0 21 2 22 2 2p p 

* 

s y 

__ n •'n _ 

1 1 

a +x.a +x-a„+,..+x a 
i_ 0 nl n n2 2 np p _ 


where 82 ^,are the reciprocals of the seasonal coefficients 

and aj^,a 2 ,.-.,a are the regression coefficients of the independent vari¬ 
ables. Both sets of coefficients are determined by the model. Since s^ 

is a seasonal coefficient it is assumed that s. = s.. for i = l, 2 ,...,n-r. 

1 i-hr 

The basic procedure used in solving for s^, i = l,2,...,n and the a^ , 
j = l,2,...,p is the method of least squares applied to the following 
function: 
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n 


min 

s., a. 
1 J 


3(s.i a ) = L 

J i=l 


-1 P 

s. y . - L X,. 

• iJ J 
j=o 


-12 


(3) 


As stated earlier, the solution procedure is iterative. For iteration k 

A — 1 A ^ A 

let y^(k) = (k) y^ and let x^(k) = S x^. a.(k). For k = 1, 

^ j=o ^ 

y^(l) = since initially s^^ is assumed to be 1.0 for all i, and 

3^(1) for j = l,2,...,p are determined by BMD02R. At iteration k, 

k > 1, the method of least sqtiares is used to solve Eq (4) for s^^(k), 

i = 1,2,...,r, where r is assumed to be 12 and s.*(k) = s.| (k) for 

1 1+r 


i = 1,2,...,n-r. 


min 3 
s"^(k) 



n 

L 

i=l 


^(k) y^ - x^(k-l) 


(4) 


where x^(k-l) is a known quantity having been determined by ordinary least 
sqxiares in iteration k-1. The equations to be solved for s.^(k) are obtained 
by taking the partial derivatives and setting them to zero. That is. 


^3i 

—^ = 0 
^ s^ (k) 

yields, for i = 1,2,...,r. 


(5) 



The second half of 
in computing y^(k) 


X. (k-1) + y X (k-1) + y.,., X „ (k-1) + 
1 _ i+r i+r _ 1+2 r i+2r 

2 2 

+ y., + y., o + ... 

•'i+r '^i+2r 


( 6 ) 


iteration k makes use of the newly computed s.^(k) 


and BMD02R is used to solve for the a.(k) in Eq (7) 
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2 


min 3of^- ~ ^ 

a.(k) J ' i=l 
J 


P 

^(k) - S X. S (k) 

^ A J "J 


The iterative process is assumed to converge when 


(7) 


I s^^(k) - s"^(k-l) 1 ^ .001 (8) 

for all i and 

1 l(k) - a^Ck-l) 1 ^ .001 for all j. 

If convergence is slow or if there is no convergence, the process is 
terminated at 25 iterations. Experience has shown that convergence is 
usually reached within 25 iterations. 

Since the BMD02R is a stepwise regression, only the independent 
variables that pass the F tolerance and meet the sign criterion are intro¬ 
duced into the regression equation. Generally, this means that most of the 
independent variables are not in the equation if the number made available 
is in the neighborhood of 15 or 20. One case was experienced at GRC where 
different sets of independent variables entered the stepwise regression 
on successive iterations. After 25 iterations, for such a case, the regres¬ 
sion equations will include only one set of the independent variables. It 
should be noted that this case is a rarity. 

Output 

Output of the regression computer program includes the time series of 
the historical and computed (using the regression equation) values of the 
dependent variable, the deseasonalization factors, the coefficients of 
the variables in the model, the forecasts, the residual errors, the 
standard error of estimate, the coefficient of multiple determination 
and the correlation matrix. The deseasonalization factors are those used 
by the model. They have been determined in conjunction with the coefficients 
of those independent variables in the equation. They are not the seasonal 
coefficients to be used by CCMPLIP. These are computed at a later step 
based on the forecast values. 
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The residual errors, standard error of estimate, and the coefficient 

of multiple determination measure the goodness of fit of the regression 

equation to the historical data. For example, the square of the coefficient 

2 2 

of multiple determination, R , 0 ^ R ^1, measures the variation in the 

dependent variable that is explained by the regression equation. The closer 
2 

R is to 1.0, the better the fit of the regression equation. Typical values 
2 

for R are in the neighborhood of .90. The definition is as follows: 

= 2 0 ^^esidual sum of squares 
total sum of squares 

The standard error of estimate, SE, is defined by 

>2 _ residual sum of squares 

residual degrees of freedom 

The standard error of estimate is a relative measure since its magnitude 
varies with the magnitude of the dependent variable. The residuals should 
be checked for sequences with the same sign, indicating some unexplained 
variation perhaps by the omission of some variable, such as a policy or 
program variable. 

In the event an independent variable that is expected to be included 
in the regression equation is not included, the reason may be that it is 
highly correlated with another independent variable that is in the equa¬ 
tion. This may be determined by examining the correlation matrix. Another 
reason for a variable’s exclusion may be that it has a sign restriction 
that prevents it from entering the regression equation. 

The output of the preprocessor program lists the independent variables 
(including copies of independent variables that have been lagged) that the 
user has made available for each dependent variable. The beginning of the 
regression output gives a similar list but with the dependent variable 
included as the first variable. Since the dependent variable is not 
included in the preprocessor list, the index of an independent variable 
in the regression output list is one more than the corresponding index in 
the preprocessor list. The user should take note of this when making cross 
reference checks. 
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The regression statistics are printed for the first iteration of 
the solution procedure and again for the last. The statistics of the 
last iteration are of greater interest since this is the final solution. 
Statistics are printed for each step of the linear regression. The last 
step is followed by a "Summary Table" that documents the change in the 
regression equation at each step. This is followed by a table entitled 
LIST OF RESIDUALS. The five columns listed in this table are entitled 
CASE, YDEP, YHAT, RESIDUAL, and PCTERR. 

The column entitled CASE lists the index number of the time series. 

The index 1 corresponds to the first month represented in the regression. 

For example, if the first month of the data base is January 1971 and the 
maximum months of lag input to any of the independent variables is 2, 
then the index 1 corresponds to the month of March 1971 rather than 
January. The last 12 months of data listed corresponds to the projection 
period and is meaningless in this report except perhaps for the YHAT which 
are computed by the regression equation using only the linear coefficients. 

The column entitled YDEP lists the historical time series of the 
dependent variable. Each of these quantities has been divided by the 
appropriate seasonal adjustment factor (i.e., the left hand side of 
Eq (2). The last 12 values are the exception. These are obtained by 
extending the last historical value for 12 months without dividing by 
the seasonal adjustment factors. The column entitled YHAT represents 
the values computed by the linear portion of the regression equation 
(i.e., the right hand side of Eq (2)). The RESIDUALS are defined as 
YDEP - YHAT and the PCTERR as RESIDUAL/YDEP. 

The next table that is printed out is entitled MULTIPLICATIVE 
SEASONALS. This table has the following five-column heading: CASE, 

YDEP, YHAT, RESIDUAL, and SEASONAL. The CASE column is defined as 
before. The definitions of YDEP and YHAT are slightly different from 
those in the previous table. The column YDEP consists of the time series 
of the actual historical values of the dependent variable (without division 
by the corresponding seasonal adjustment factor as was the case in the 
previous table). The YHAT column consists of the product of the appropriate 
seasonal adjustment factor (for each month) and the monthly value computed 
by the linear portion of the regression equation. The SEASONAL column 
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consists of the seasonal adjustment, factor for each historical month 
that is computed by the nonlinear portion of the regression model. The 
RESIDUAL column is also defined as before. 

Unlike the previous table, the range of this table does not include 
the 12 future months. The forecast values are given in the next table 
entitled FUTURE TIME STREAM with the three column headings CASE, FORECAST 
and SEASONALS. The CASE and SEASONAL columns are defined as before, the 
FORECAST column contains the 12 months of seasonally adjusted projected 
values. It is these forecast values that are used in the post processor 
programs and subsequently in C0MPLIP-G2. 

Sample Regression Results 

The results discussed here are taken from the NPSGM system test rxm. 

The main purpose of the test run was to test the computer programs rather 
than to provide evidence of the quality of forecasts of NPS gains. Exten¬ 
sive analysis and experimentation with the model was not possible because 
of budget constraints. The test results should be considered illustrative 
of the model's capability rather than representative. 

Although the volunteer data that were available extended through June 
1974, only data through December 1973 were used and projections for GY 1974 
were made. The reason for this is that the model should only be used to 
project supply limited enlistees. For several months, beginning with December 

ik 

1974, the volunteers were enlisted on a demand limited basis, making it a 
poor time interval in which to test the model. Therefore, the forecasts for 
December 1974 were greatly overprojected by the test run and have been 
excluded from the statistics shown in Table 9. 

Two regression runs were made for each of the three population groups 

2 

shown in Table 9, one for blacks and one for non-^blacks. The R coefficient 
measures the goodness of fit of the regression results to the historical 
data. Values in the neighborhood of .9 are generally considered satisfactory. 
Both, the mean errors and the mean absolute errors are computed on the basis 


For future applications of the model where the demand limited accessions 
are a part of the historical data, one or more special independent variables 
(perhaps based on quotas) should be introduced to help explain the enlist¬ 
ments during this period of departure from supply limitation. 
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Table 9 


TEST RESULTS 
(Excluding December) 


Population group 

(Black) 

R^ 

(Not black) 

% Mean 

error 

% Mean 

Absolute error 

1. Male NHSG, Mental Category .97 

I, II, III 

.93 

4.4 

11.0 

2. Male HSG, Mental Category 
I, II, III (Case 1) 

.89 

.83 

-10.2 

12.0 

3. Male HSG, Mental Category 
I, II, III (Case 2) 

.92 

.84 

- 7.6 

10.0 


of the first 11 months of the projections. The mean error reflects the 
cancellation of positive and negative error terms, whereas the mean absolute 
error is based on the magnitude of the errors without regard to sign. 

Population Groups 2 and 3 are identical; however, the regression used 
different sets of independent (or explanatory variables). The time variable, 
defined in Table 7, was included for Group 3 and played an important role 
in entering the regression equation, whereas, it was omitted from Group 2. 
Notice that it made an improvement in the forecasts. Additional improvements 
could very likely be achieved by further analysis and experimentation with 
the model. 

Table 10 compares the forecasts with the actual values for the population 
Group 1 defined in Table 9. Note the poor forecast for the iKinth of December 
1974. As was stated earlier, the reason for this is that December 1974 was 
the beginning of the period in which accessions were demand limited rather 
than supply limited. Whenever large deviations such as are shown for the 
months of January and July are projected, then further analysis should be 
undertaken to try to determine the causes. For example, an independent 
variable may need to be constructed to reflect the result of a program or 
policy decision that would explain these deviations. 

It should be stated that proper use of the model, requires analytical 
expertise in (1) interpreting and analyzing the statistical results such 
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Table 10 


COMPARISON OF FORECASTS WITH ACTUALS FOR CY74 


NFS Males, NHSG, Mental Category I, II, III 



FORECAST 



Difference 

Forecast 

-Actual 


Month 

Case 6 
Black 

Case 7 

Not Black 

Total 

Actual 

% 

Difference 

Jan 

2198.6 

6541.9 

8741 

7056 

1685 

23.9 

Feb 

1484.0 

4670.8 

6155 

5910 

245 

4.1 

Mar 

1995.0 

4026.0 

6021 

5777 

244 

4.2 

Apr 

1720.8 

3713.0 

5434 

5765 

-331 

-5.7 

May 

2044.1 

3116.4 

5161 

5477 

-316 

-5.8 

Jun 

3199.3 

5454.5 

8654 

7470 

1184 

15.9 

Jul 

2016.6 

5099.6 

7116 

5768 

1348 

23.4 

Aug 

2245.9 

4476,6 

6723 

7076 

-353 

-5.0 

Sep 

2210.5 

3859.5 

6071 

7405 

-1334 

-18.0 

Oct 

2047.5 

5484.1 

7532 

6818 

714 

10.5 

Nov 

1689.4 

4911.8 

6601 

6536 

65 

1.0 

Dec 

1484.5 

4748.0 

6233 

2959 

3274 

110.6 

Totals 
& Mean 
Error 

24,338 

56,104 

80,442 

74017 

6425 

8.7 

Totals and Mean Error 

Excluding December 

74,208 

71058 

3150 

4.4 

Mean Absolute Error 



7819 

11.0 
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as: elasticity, tests of statistical significance, collinearity, 

2 

residuals, coefficient of multiple determination (R ), and standard 
error of estimate; (2) formulation of explanatory variables based on 
economic conditions, policy decisions and programs, and (3) use of 
such variables. 

POST PROCESSOR 

One of the post processor programs aggregates the monthly forecasts 
into at most four population categories used in COMPLIP. Monthly fore¬ 
casts for the supply limited groups can be combined to form data for 
COMPLIP constraints on annual availability. Since the forecasts are 
^ based on several years of historical data they are used as the basis 

for computing the seasonal coefficients used by COMPLIP by simply noi- 
malizing the first 12 months of the forecast values. Suppose for a 
/ given COMPLIP C-group the forecasts are denoted by , ^ 2 ’* *'*^12 

the corresponding seasonal coefficients by s,3^,. . . ,s ^2 • The computation 
is as follows: 

V. 

®i ^ ""12 ’ ^ " 1,2,. ..,12. 

S V 

j=l ^ 

Another function of the post processor is to produce printer plots of 
/ historical and forecast values of each of the dependent variables. This 

is accomplished by the graph program. The preprocessor and the regression 
routines generate the files of the data to be plotted. 
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Chapter 3 


USAGE 


This chapter is oriented towards the application of NPSGM and hence 
contains such information as input card and file descriptions and operat¬ 
ing procedures. The NPSGM consists of six programs defined as follows: 


Program #1 - 

Program #la - 
Program #2 - 

Program #3 - 

Program #4 - 

Program //5 - 


Data generator, used for initial creation 
of the frequency file. 

Frequency file update routine, used for 
updating an existing frequency file. 
Regression preprocessor, used for preparing 
regression inputs, and header data for the 
graph and aggregation files. 

Regression program. 

Graph program, used for developing printer 
graphs of historical and projected population 
time series. 

Aggregating program, used for aggregating 
projected population groups to form C-groups 
used in C0MPLIP-G2 and compute seasonal 
coefficients. 


The files in the schematics for each of the program s are denoted 
as tape files, although they may be disk files. For example, if logical 
unit 10 is a disk file, it is commonly referred to as TAPEIO. Each file 
has an MT (magnetic tape) identifier for ease in cross referencing. Table 
11 gives a summary of the files, identifying the creating program, the 
using program, the logical unit, and the MT identifier. Table 12 summarizes 
the file characteristics and approximate volume of data to be expected. The 
files are identified with the MT identifier used in Table 11 or the program 
schematics presented for each program description. 
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Table 11 


FILE SUMMARY 


File description 

-r 

Action 1 

Program // 1 

Logical 

unit 

MX 

identifier 

MVA file 

used by 

1 

1 

MTl 

ELIM III Cohort file 

used by 

1 

2 

MT2 

Frequency file 

created by 

1 

3 

MT3 


alternatively 
created by 

la 

2 

MT5 


used by 

la 

1 

MT3 or MT5 


used by 

2 

1 

MT3 or MT5 

Gain/loss transaction 
file 

used by 

la 

4 

MT4 

Independent variable 
data 

used by 

2 

10 

MT6 

Regression input file 

created by 

2 

7 

MT7 


used by 

3 

7 

MT7 

Regression control cards 

created by 

2 

50 

MTS 


used by 

3 

5 

MTS 

Scratch file 1 

used by 

3 

1 

MT9 

Scratch file 2 

used by 

3 

2 

MT9a 

Aggregation file 

created by 

2 

15 

MTIO 


augmented by 

3 

15 

MTIO 


used by 

5 

2 

MTIO 

Graph file 

created by 

2 

20 

MTll 


augmented by 

3 

30 

MT12 


used by 

4 

1 

MT12 

Aggregated C-groups 

created by 

5 

10 

MTl 3 

Graphs (to be copied 
to printer) 

created by 

4 

10 

MT14 
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Table 12 


SUMMARY OF FILE CHARACTERISTICS 


MT identifier 
of file 

COBOL/ 

FORTRAN 

Binary/ 

BCD 

Maximum 

record' 

length 

Fixed/ 

variable 

* 

Maximum 

size 

** 

MTl 


BCD 

60 

F 


MT2 

C 

BCD 

137 

V 

137,000,000 

MT3 

C 

BCD 

9 

F 

125,000 

MT4 

C 

BCD 

135 

V 

13,500,000 

MTS 

C 

BCD 

9 

F 

125,000 

MT6 

F 

BCD 

30 

F 

50,000 

MT7 

F 

BCD 

10 

F 

50,000 

MTS 

F 

BCD 

80 

F 

50,000 

MT9 

F 

BIN 

6 

V 

1,000 

MT9a 

F 

BIN 

255 

V 

30,000 

MTIO 

F 

BCD 

10 

F 

5,000 

MTll 

F 

BIN 

10 

F 

5,000 

MT12 

F 

BIN 

132 

V 

5,000 

MT13 

F 

BCD 

60 

V 

3,000 

MT14 

F 

BCD 

132 

V 

200,000 


* 

Measured in number of characters for coded files and number of 
words for binary files. 

This file need not be processed by USAMSSA. 
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DATA GENERATOR (PROGRAM //I) 

The data generator is a COBOL based, batch mode program which 
creates a coded frequency file of non-prior-service volunteer data. A 
schematic of it is shown in Fig. 5. Input to this program is the GRC 
MVA data base, used to extract draftee data prior to June 1973, and the 
other input is the ELIM Cohort File, which provides volunteer data from 
January 1971 to the present.* This program was conceived to be run only 
one time, as another program serves to update the frequency file when 
new data become available, A flowchart of the program is given in Fig. 6. 

An internal data array, 250 x 50 (where 50 is a dimension that repre¬ 
sents the number of months of data that the program may store), provides 
temporary storage for volunteer accession counts until the end of the 
input data. The program tabulates series for each of 250 population 
categories defined in Table 2 of Chap. 2. The categories are partitioned 
into two sets. They are mutually exclusive and exhaustive within each 
set. The first set or partition consists of popiUation categories 1 - 196, 
the second 197 - 250. 

Data items, read from input, which make up the 250 different time-series 
are: sex, race, age, term of service, education, mental category and bonus 
received. Category numbers 1-4 are comprised only of females and ordered 
by civilian education, race and mental category. Category numbers 5—196 
include only males ordered by civilian education, race, mental group, age 
and bonus received. Category numbers 197-250 include both males and females 
ordered by civilian education, race, mental group and term of enlistment. 

The separate mention of category numbers 1-4 is solely to inform the user 
that these four cells contain only female accessions data that are tinder 
slightly different sequence arrangement than are the remainder of Parti- 
tion 1. For every valid record processed by programs #1 or //la, one 
entry is made in each of the two population partitions, so that both 
partitions reflect the same population• However, the populations may differ 
slightly for the two partitions because of the different data items defin¬ 
ing each of them and the possible omission of some records due to invalid 
codes. 


"program la is the update program to be discussed in the next section. 

When this processing occurred the cohort file contained data through 
June 1974. 
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Fig. 6—^Flowchart for the Data Generator—Program 


Dump statistics 
to printer on 
counts, etc. 



t 


Print report on 
data activity 
and time-series 
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Upon reaching end-of-file on the input data, a report summarizing 
the data processed is printed, along with a monthly count of the 250 
series. These same data, written to tape in time-series form, become 
the frequency file. 


Files 

Input: 

TAPEl GRC MVA accession file 

TAPE2 ELIM III Cohort file 


Output: 

TAPES Frequency file 

PRINTER Report output, statistics on lO counts and 
input data validation summary 

TAPEl . This is a coded file that contains accession data used to 
extract draftee information. It is of fixed record length with 40 records 
per block and 60 characters per record (MVA file, MTl). 

TAPE2 . This is a coded file containing RA enlisted data. It con¬ 
tains variable length records from 60 to 137 characters (ELIM-III cohort 
file, MT2). 

TAPES . Also a coded file, TAPES is the output frequency file having 
a fixed record length of nine characters per record, and 250 records 
per block (MTS). 

Printer , This file displays total of records read and written and 
data validity counts. It also gives a report on the number of accessions 
per month which are in each of the 250 series. 

The Job Control for CDC Cyber 70 for the Frequency File Generator is 
as follows: 

T2000, MTS Program #1 

COBOL CLR,D) 

REQUEST, TAPEl, R, S. 

REQUEST, TAPE2, R. 

* 

VSN, TAPEl = 2434/2712/0538/3771/2043/1268. 

'k 

VSN, TAPE2 = 2546/2571/2757. 

REQUEST, TAPE3, W, S, VSN = SAVE. FREQ FILE 

FILE (TAPEl, BT=K, RT=F, MBL=2400, FL=60, RB=40, CM=YES) 

* 

These are internal GRC type numbers. 
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FILE (TAPE2, BT=C, RT=Z, FL=137, CM=YES) 

FILE (TAPE3, BT=K, RT=F, MBL=2250, FL=09, RB=250, CM=YES) 

LDSET (FILES=TAPE1/TAPE2/TAPE3) 

LGO 

FREQUENCY FILE UPDATE ROUTINE (PROGRAM #la) 

This COBOL program, like the Data Generator, creates a monthly time- 
series of 250 elements, based on specific data fields from the gain/loss 
transaction file. A schematic of this program is given in Fig. 7 and a 
flowchart is given in Fig. 8. 

Inputs to this program are: (1) the gain/loss file, (2) the fre¬ 
quency file and (3) one parameter card. Outputs of the program are: 

(1) a new frequency file with new data applied and header records 
adjusted, and (2) a printer output file which displays statistics of 
the run and gives summaries of the data processed. The program has an 
internally dimensional array of 250 x 6, which allows input data to cover 
a six-month range. Data outside this range will be ignored. 

A parameter card specifies the last calendar month for which the fre¬ 
quency file is to be updated. For the normal update mode, this last 
update month is later than the last month of the frequency file prior to 
the update. However, it is possible to add data for prior months, with 
such late transaction data being combined with the corresponding data 
already on the file. If the date on the parameter card is more than six 
months beyond the last month of the frequency file, an error condition is 
assumed and the program terminates. Other possible combinations of data 
base and parameter cards are described in Fig. 9. 

Upon finding the parameter card date within the update capability 
of the program, as much of the original frequency file that will not be 
altered during the run is copied to the output frequency file. After 
processing of the data, the internal arrays are summed with the correspond¬ 
ing months of data on the original frequency file and copied to the output 
frequency file. 

The frequency file contains four header records which contain starting 
date, of the time-series and the length, in months, of the data base. During 
an update, corrections are made to the header to reflect the addition of new 

data. 
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Fig. 9—^Hypothetical Examples of Updates 
















A knowledge of the ending point of the existing frequency file is 
critical to a successful update. 

Files 

Input: 

TAPEl Frequency file (original) 

TAPE4 Gain/loss file 

CARD Parameter card (1) 

Output: 

TAPE2 Frequency file (newly generated) 

PRINTER Lists 10 operation covints, input parameter 
card list and new header information 

TAPEl . This is the coded frequency file generated by Program #1 
(Data Generator), (MT3); or by //la as MT5 in a previous run. 

TAPE2 . This file is identical to TAPEl, in format, but reflects 
changes in the header and total file length, due to updating. In addition 
to the adding of new months of data, earlier months may be different if 
late transactions are posted, (MTS). 

TAPE4 . Gain/loss file that is used to update the frequency file 
(MT4) . 

Printer . Contains information pertaining to the update, such as 
starting and ending times of the update period, and how many months of 
new data will be added. It also displays new values assigned to replace 
the header records of the frequency file. 

Input Card . A single parameter card identifies the last calendar month 
that the update will cover. The format is simply: Month/Year (12, IX, 12). 

All comparison against the input frequency file starting date is based 
on this date, and internal storage registers also are based on the accuracy 
of the date on this card. 

A typical example of a parameter card for a particular update is as 
follows: 

Data base length = 32, witli last month of data 
base being 06/75. 

Parameter card being of the form 10/75 would produce 4 months of new data 
and update 2 months of previously existing data, if any data exists for 


these two months. 
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REGRESSION PREPROCESSOR AND REGRESSION PROGRAMS (Programs //2 and #3) 

Both are FORTRAN programs. All the input files and control cards 
are supplied to the preprocessor program, which in turn prepares all the 
input for the regression program. In fact, in order to run the regression 
program, the preprocessor must first be exercised in order to prepare the 
input for the regression program. The two programs must be run under the 
same job control because one file (MTIO) that is initially written in the 
preprocessor is continued to be written by the regression program and there¬ 
fore must not be rewound at the start of the regression run. A schematic 
of the two programs is shown in Fig. 10. 

By use of input parameter cards, the user is able to select many com¬ 
binations of dependent variables from the frequency file, along with inde¬ 
pendent variables that help to explain variations in the dependent variable 
and attempt to find patterns so as to forecast these variables for the future. 

For each dependent variable selected, the user is able to select inde¬ 
pendent variables to be regressed against the dependent variable. This 
constitutes a single problem. A subproblem would include the use of the 
same dependent variables. A listing of the independent variable time series 
is given in App. A. 

The regression routine used in conjunction with the preprocessor is 
the nonlinear stepwise package described in Chap. 2. The BMD02R users 

■k 

manual can be referenced to help explain the listing of most of the input 
data, which is generated for the user with the preprocessor. 

Frequency File Arrangement and Aggregation 

The frequency file is comprised of 250 cells per month for as many 
months as the data base covers. These 250 cells are broken into two 
sets (or partitions) consisting of 196 and 54 categories, respectively. 

For any particular run, a maximum of 20 aggregations can be made 
from either of the partitions, thereby creating 20 dependent variables. 

Care must be taken to ensure that for a selected run, all choices of 
dependent variables come from the same partition. If any cells within 

k 

See Reference 5 of Chap. 2. 
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Fig. 10—Steps 2 and 3 in the Projection of NFS Gains 







the selected partition are omitted, an error message will be given, as 
will the case where a cell from the wrong partition is selected to form 
one of the dependent variables. Neither of these t\<fo error conditions 
will terminate the run, but messages will be given, listing the cells 
which were duplicated, omitted or included from the wrong partition. 

The preprocessor will only abort if the input card sequence is in error 
or if one of the cards specifying the selected cells is not in ascending 
order. (Refer to series card, type 3, for a further description of the 
error.) 

Refer to the detailed tables. Tables 13 and 14 of the frequency file 
layout, in order to select cells that are to be combined to form each 
dependent variable. 

File Description 

Preprocessor input consists of: 

1) TAPEl (MT3 or MTS) ELIM-COMPLIP frequency (dependent 
variable data) 


2) 

TAPEIO 

(MT6) 

Independent variable time-series data base 
to be maintained by means of the system 
update or edit features. 


3) 

TAPES 


User control cards. 

< 

Preprocessor output 

consists of: 


1) 

TAPES0 

(MTS) 

Regression control card file. 


2) 

TAPE? 

(MT7) 

Dependent and independent variable time- 
series . 


3) 

TAPE6 and 
System printer 

Printer file with all user control card 
activity, regression card image input 
data and statistics on data selected for 
each regression. 


4) 

TAPE20 

(MTll) 

Date and data structure information for 
dependent variable graphs. 

( 

i 

5) 

TAPEIS 

(MTIO) 

Data structure information for aggregations. 


Regression 

routine 

input: 

\ 

{ 

1) 



Items (1), (2), (4) and (5) under preprocessor 

j 


output using the same logical units except for 
TAPE 50, which is read as TAPES. 


so 


Table 13 


CATEGORIES FOR TRUE VOLUNTEER NFS GAINS FREQUENCY DISTRIBUTION 

(Population Partition 1) 


Female^ 


Category ■ 

number ( 

Civilian 

education 

Race 


Age_ 

Bonus 

1 

HSG^ 

black 

1,2 

all 

no 

2 

ft 

non-black 

t! 

tf 

ff 

3 

GED^ 

black 

If 

ff 

ft 

4 

ft 

non-black 

ff 

ft 

ff 


Male^^ 


Category- 

number 

Civilian 

education 

Race 

HKiSISi 

Age 

Bonus 

5 

HSG 

black 

1,2,3A 

<18 

yes 

6 

ff 

ft 

ff 

ft 

no 

7 

ff 

tf 

ff 

^18, <20 

yes 

8 

Tf 

ft 

ff 

ff 

no 

9 

ff 

tf 

tf 

^20, <22 

yes 

10 

ff 

tf 

tf 

ft 

no 

11 

tf 

It 

ff 

^22 

yes 

12 

tf 

ft 

ft 

tf 

no 

13 

HSG 

black 

3B 

<18 

yes 

14 

If 

ft 

ff 

ft 

no 

15 

ff 

ft 

ft 

^18, <20 

yes 

16 

ft 

ft 

tf 

ft 

no 

17 

If 

ft 

Tf 

2:20, <22 

yes 

18 

ff 

ft 

ff 

tf 

no 

19 

ff 

ft 

Tf 

^22 

yes 

20 

ft 

ft 

ft 

M 

no 
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Table 13(cont*d) 


Male^ 


Category 

number 

Civilian 

education 

Race 

Mental 

group 

Age 

Bonus 

21 

HSG 

black 

4,5 

<18 

yes 

22 

I! 

ft 

II 

II 

no 

23 

H 

II 

11 

^18,<20 

yes 

24 

II 

If 

II 

II 

no 

25 

II 

It 

If 

^20 ,<22 

yes 

26 

II 

II 

If 

ft 

no 

27 

11 

II 

If 

^22 

yes 

28 

If 

If 

fl 

If 

no 

29 

HSG 

not black 

1,2,3A 

<18 

yes 

30 

II 

ft 

II 

II 

no 

31 

II 

If 

II 

^18,<20 

yes 

32 

If 

If 

11 

If 

no 

33 

II 

If 

II 

^20 ,<22 

yes 

34 

It 

II 

If 

II 

no 

35 

11 

II 

If 

^22 

yes 

36 

II 

If 

If 

II 

no 

37 

HSG 

not black 

3B 

<18 

yes 

38 

It 

11 

If 

II 

no 

39 

II 

II 

If 

^18, <20 

yes 

40 

If 

ft 

If 

II 

no 

41 

If 

II 

If 

^20, <22 

yes 

42 

If 

II 

II 

II 

no 

43 

If 

II 

fl 

^22 . 

yes 

44 

II 

II 

ft 

II 

no 
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Table 13(cont'd) 


Male*^ 


Category 

number 

Civilian 

education 

Race 

Mental 

group 

Age 

Bonus 

45 

HSG 

not black 

4,5 

<18 

yes 

46 

?i 

II 

If 

fl 

no 

47 

IT 

II 

11 

^18, <20 

yes 

48 

IT 

IT 

If 

II 

no 

49 

IT 

II 

If 

2:20, <22 

yes 

50 

If 

If 

If 

If 

no 

51 

II 

11 

II 

-2.11 

yes 

52 

II 

II 

It 

II 

no 

53 

GED 

black 

1,2,3A 

<18 

yes 

54 

II 

If 

II 

If 

no 

55 

II 

II 

II 

^18, <20 

yes 

56 

II 

II 

If 

II 

no 

57 

II 

II 

IT 

^20, <22 

yes 

58 

II 

IT 

If 

If 

no 

59 

II 

II 

IT 

^22 

yes 

60 

IT 

If 

II 

fl 

no 

61 

GED 

black 

3B 

<18 

yes 

62 

fl 

IT 

If 

If 

no 

63 

II 

II 

fl 

sl8, <20 

yes 

64 


II 

It 

It 

no 

65 

IT 


II 

^20, <22 

yes 

66 

IT 

II 

If 

II 

no 

67 

If 

II 

If 

^22 

yes 

68 

II 

II 

If 

fl 

no 
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Table 13(Cont’d) 


Male^ 


Category 

number 

Civilian 

education 

Race 

Mental 

group 

Age 

Bonus 

69 

GED 

black 

4,5 

<18 


yes 

70 

If 

If 

If 

II 


no 

71 

If 

If 

If 

^18, <20 

yes 

72 

1! 

If 

II 

If 


no 

73 

If 

It 

II 

^20, <22 

yes 

74 

IT 

If 

If 

II 


no 

75 

n 

If 

If 

^12 


yes 

76 

If 

If 

II 

II 


no 

77 

GED 

not black 

1,2,3A 

<18 


yes 

78 

M 

If 

II 

II 


no 

79 

If 

II 

II 

^18, <20 

yes 

80 

If 

If 

II 

II 


no 

81 

If 

If 

II 

^20, <22 

yes 

82 

If 

II 

II 

II 


no 

83 

II 

If 

II 

^22 


yes 

84 

II 

If 

If 

II 


no 

85 

GED 

not black 

3B 

<18 


yes 

86 

If 

ft 

II 

II 


no 

87 

II 

II 

II 

sl8, <20 

yes 

88 

II 

II 

II 

IT 


no 

89 

II 

II 

II 

s20, <22 

yes 

90 ' 

It 

II 

II 

II 


no 

91 

11 

If 

IT 

^22 


yes 

92 

It 

It 

II 

II 


no 
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Table 13(cont'd) 


Male^ 


Category 

Civilian 


-1 

Mental 



number 

education 

Race 

group 

Age 

Bonus 


93 

GED 

not black 

4,5 

<18 

yes 

94 

If 

II 

II 

If 

no 

95 

fl 

If 

If 

sl8, <20 

yes 

96 

If 

I? 

11 

II 

no 

97 

M 

If 

II 

^20, <22 

yes 

98 

I! 

II 

II 

ft 

no 

99 

fl 

If 

If 

^22 

yes 

100 

It 

It 

fl 

II 

no 

101 

some HS 

black 

1,2,3A 

<18 

yes 

102 

If 

ft 

If 

If 

no 

103 

If 

If 

II 

^18, <20 

yes 

104 

II 

fl 

II 

fl 

no 

105 

If 

fl 

II 

s20, <22 

yes 

106 

If 

II 

It 

If 

no 

107 

If 

If 

n 

^22 

yes 

108 

II 

II 

II 

II 

no 

109 

some HS 

black 

3B 

<18 

yes 

110 

II 

II 

fl 

II 

no 

111 

IT 

II 


^18, <20 

yes 

112 

II 

II 

II 

II 

no 

113 

If 

If 

II 

^20, <22 

yes 

114 

If 

If 

ft 

II 

no 

115 

fl 

fl 

II 

^22 

yes 

116 

IT 

II 

It 

II 

no 
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Table 13 (cont'd) 


Male"^ 


Category 

number 

Civilian 

education 

Race 

Mental 

group 

Age 

Bonus 

117 

some HS 

black 

4,5 

<18 

yes 

118 

!l 

If 

II 

ff 

no 

119 

If 

If 

II 

^18, <20 

yes 

120 

1! 

If 

If 

If 

no 

121 

M 

11 

ff 

^20, <22 

yes 

122 

!l 

It 

II 

ft 

no 

123 

Tf 

II 

If 

^22 

yes 

124 

If 

II 

If 

If 

no 

125 

some HS 

not black 

1,2,3A 

<18 

yes 

126 

!? 

II 

If 

tf 

no 

127 

If 

II 

If 

^18, <20 

yes 

128 

II 

II 

If 

If 

no 

129 

II 

If 

If 

^20 ,.<22 

yes 

130 

II 

II 

ff 

If 

no 

131 

If 

If 

If 

^22 

yes 

132 

II 

If 

11 

If 

no 

133 

some HS 

not black 

3B 

<18 

yes 

134 

11 

ff 

II 

If 

no 

135 

IT 

ft 

It 

kl8, <20 

yes 

136 

II 

II 

II 

If 

no 

137 

If 

If 

II 

^20, <22 

yes 

138 

IT 

If 

II 

If 

no 

139 

It 

If 

It 

^22 

yes 

140 

II 

If 

II 

tf 

no 
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Table 13(cont'd) 


Male‘s 


Category 

Civilian 



1 Age 


number 

education 

Race 



Bonus 


141 

some HS 

not black 

4,5 

r“.— ..1 T . M. 

<18 

yes 

142 

ft 

II 

II 

If 

no 

143 

!1 

If 

11 

^18, <20 

yes 

144 

ft 

11 

II 

It 

no 

145 

If 

II 

If 

^20, <22 

yes 

146 

If 

II 

It 

ft 

no 

147 

f1 

If 

II 

^22 

yes 

148 

If 

If 

ft 

fl 

no 

149 

no HS 

black 

1,2,3A 

<18 

yes 

150 

rr 

II 

tf 

It 

no 

151 

If 

If 

II 

sl8, <20 

yes 

152 

tf 

ft 

II 

11 

no 

153 

If 

If 

If 

2:20, <22 

yes 

154 

If 

If 

If 

II 

no 

155 

If 

1? 

It 

^22 

yes 

156 

It 

II 

II 

II 

no 

157 

no HS 

black 

3B 

<18 

yes 

158 

If 

II 

II 

If 

no 

159 

II 

II 

II 

^18, <20 

yes 

160 

II 

If 

II 

fl 

no 

161 

II 

It 

TI 

s20, <22 

yes 

162 

tf 

II 

If 

II 

no 

163 

ft 

II 

If 

^22 

yes 

164 

If 

II 

II 

II 

no 
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Table 13(contM) 


Male^ 


Category 

Civilian 





number 

education 

Race 


Age 

Bonus 


165 

no HS 

black 

4,5 

<18 

yes 

166 

1! 

II 

ft 

If 

no 

167 

rf 

It 

If 

;sl8, <20 

yes 

168 

If 

11 

II 

II 

no 

169 

Tl 

II 

II 

^20, <22 

yes 

170 

M 

11 

II 

II 

no 

171 

ft 

It 

tl 

^22 

yes 

172 


It 

If 

II 

no 

173 

no HS 

not black 

1,2,3A 

<18 

yes 

174 

If 

IT 

It 

tl 

no 

175 

I? 

II 

II 

^18, <20 

yes 

176 

If 

If 

Tl 

It 

no 

177 

It 

II 

Tl 

^20, <22 

yes 

178 

If 

M 

II 

II 

no 

179 

II 

ft 

tl 

s22 

yes 

180 

If 

II 

If 

If 

no 

181 

no HS 

not black 

3B 

<18 

yes 

182 

ft 

11 

It 

ft 

no 

183 

It 

II 

It 

^18, <20 

yes 

184 

II 

If 

It 

II 

no 

185 

II 

If 

11 

^20, <22 

yes 

186 

II 

If 

It 

If 

no 

187 

IT 

If 

II 

s22 

yes 

188 

tl 

II 

II 

II 

no 
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Table 13 (cont'd) 


Male 


Category 

number 

Civilian 

education 

Race 

Mental 

group 

Age 

Bonus 

189 

no HS 

not black 

4,5 

<18 

yes 

190 


II 

II 

II 

no 

191 

ri 

II 

II 

kl8, <20 

yes 

192 

II 

II 

II 

11 

no 

193 

It 

II 

II 

^20, <22 

Yes 

194 

11 

II 

II 

II 

no 

195 

II 

II 

II 

^22 

yes 

196 

II 

11 

It 

II 

no 


^It is assumed that all females are true volunteers, either HS 
graduate or GED, mental group 1 or 2, and no bonus. 

^High school graduate. 

^General education development. 

^Voltmteers for draft and RA. 
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Table 14 


CATEGORIES FOR TRUE VOLUNTEER NFS GAINS FREQUENCY DISTRIBUTION 

(Population Partition 2) 


Category 

number 

Civilian 

education 

Race 

Mental 

group 

Term of 
enlistment 

197 

HSG 

black 

1,2,3A 

2 

198 

fl 

M 

3B 

It 

199 

II 

II 

4,5 

II 

200 

II 

not black 

1,2,3A 

II 

201 

II 

fl 

3B 

II 

202 

II 

11 

4,5 

II 

203 

GED 

black 

1,2,3A 

2 

204 

fl 

II 

3B 

II 

205 

II 

II 

4,5 

II 

206 

II 

not black 

1,2,3A 

II 

207 

11 

II 

3B 

If 

208 

11 

It 

4,5 

fl 

209 

not HSG or GED 

black 

1,2,3A 

2 

210 

II 

11 

3B 

If 

211 

II 

If 

4,5 

It 

212 

II 

not black 

1,2,3A 

II 

213 

II 

11 

3B 

II 

214 

If 

It 

4,5 

II 

215 

HSG 

black 

1,2,3A 

3 

216 

11 

II 

3B 

If 

217 

II 

11 

4,5 

It 

218 

II 

not black 

1,2,3A 

II 

219 

If 

M 

3b 

II 

220 

II 

II 

4,5 

II 
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Table 14 (cont’d) 


Category- 

number 

Civilian 

education 

Race 

Mental 

group 

Term of 
enlistment 

221 

GED 

black 

1,2,3A 

3 

222 

tl 

It 

3B 

If 

223 

tt 

1? 

4,5 

II 

224 

tt 

not black 

1|2,3A 

II 

225 

ir 

!l 

3B 

II 

126 

ff 

tl 

4,5 

II 

Til 

not HSG or GED 

black 

1,2,3A 

3 

228 

If 

It 

3B 

II 

229 

tl 

If 

4,5 

tl 

230 

If 

not black 

1,2,3A 

11 

231 

ft 

If 

3B 

II 

232 

tl 

If 

4,5 

If 

233 

HSG 

black 

1,2,3A 

4,5,6 

234 

If 

It 

3B 

II 

235 

It 

n 

4,5 

II 

236 

If 

not black 

1,2,3A 

II 

237 

If 

If 

3B 

II 

238 

It 

II 

4,5 

ft 

239 

GED 

black 

1,2,3A 

4,5,6 

240 

It 

ft 

3B 

It 

241 

It 

II 

4,5 

If 

242 

If 

not black 

1,2,3A 

ft 

243 

It 

II 

3B 

If 

244 

If 

II 

4,5 

11 
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Table 14(cont’d) 


Category 

number 

Civilian 
education | 

Race 

Mental 

group 

Term of 
enlls tmen t 

245 

not HSG or GED 

black 

1,2,3A 

4,5,6 

246 

f? 

ft 

3B 

If 

247 

II 

It 

4.5 

If 

248 

If 

not black 

1,2,3A 

fl 

249 

It 

If 

3B 

If 

250 

If 

If 

4,5 

ft 
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Regression output: 

1) TAPE15 (MTIO) Data structure information and forecasts 

for each regression run, to be used by the 
aggregation program. 

2) TAPE6 Printer file with complete regression results. 

3) TAPE30 (MT12) Date, data structure information, historical 

and forecast data for the graph program. 


Input Parameter Card Description 

Six card types are used for proper initiation of the preprocessor 
routine. Five are required and one is optional. 


Card type #1 Must precede all other input cards and contains base yr/mo data, 
title for the run, optional ’F' levels, and output controls. 

Card type #2 Contains information as to which table will be in use 

for run, and the name of one of the dependent variables. 

It also contains fields for the number of subproblems, 
the number of variables added by transgeneration and the 
number of transgeneration cards, if any. 

Card type #3 This card contains the series numbers that will be combined 
to form the dependent variable defined by card type #2. 

Card type //4 This card simply says 'END' and designates the end of 
a section. It shows the end of the dependent variable 
section, or the end of each group of independent variables 
selected for use in conjunction with each dependent variable. 

Card type #5 Contains the name of an independent variable, along with 
information about its effect on the regression equation, 
i.e., where the variable can enter the equation with 
positive or negative coefficients, and what lags the 
variable may have. 

Card tjqje #6 This card is optional and is only used for transgenera¬ 
tion of variables. It is used in conjimction with card 
type #2. 

Table 15 gives the layout for the required cards and Table 16 for the 

optional cards. Figure 11 shows the order of the input cards. 
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PREPROCESSOR INPUT PARAMETER CARD LAYOUT 
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OPTIONAL CARD USED IN CONJUNCTION WITH CARD TYPE 2 
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End of file 


This sequence of cards 
is repeated for each 
subproblem defined 
on cards of type 
2 



END 


Additional cards for additional 
independent variables 


-2 


-1 




MICIPA 


-2 


-1 


TRNGEN 


(optional) 


END 


Card type 4, denotes the end of the 
dependent variable section 


This sequence 
or cards is 
repeated for 
each depen¬ 
dent vari- i TABl 
able 



Cards of type 3, define dependent variable 


Title for card type 2 
card type 2 


BASE 


Card type 1, (first card, required only once) 


Fig. 11—^Preprocessor Input Parameter Sequencing 
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Detailed Description of Dependent Variable Card Type 2 

1. Field position 2 (dependent variable name) 

This eight character field contains the name which the user 
assigns to the dependent variable which he creates. This name is 
carried throughout the regression routine and all subsequent reference 
to any dependent variable is via this name 

2. Field position 3 (subproblem number) 

This number denotes the number of times the stepwise regression 
subroutine is initiated for that particular dependent variable. Referring 
to Fig. 11, it is necessary to insert complete sets of independent vari¬ 
able cards, back-to-back, equal to the number specified in this field. 

If this fact is overlooked, the correspondence between the dependent 
variables and the sets of independent variables will be out of sequence. 

3. Field position 4 (number of transgeneration cards) 

This field should be zero filled except where transgeneration 
cards are to appear in the regression. 

4. Field position 5 (number of variables added due to transgeneration) 
Some transgeneration codes do not create new variables, but simply 

alter existing independent variables, in which case this field should be 
zero. (Refer to Appendix B for details on transgeneration codes). 

Detailed Description of Series Card Type 3 

1. This card is broken into eight 10 character numeric fields. Each 
of the eight fields is again broken into two 5 character fields. 

2. To use one of these 10 character fields as an example, assume 
that for a particular dependent variable, the following series numbers 
are to be selected: 1 to 6, 12, 37, 55 to 60, 65, and 70 to 73. 

Rather than require to user to punch 1, 2, 3, 4, 5 and 6 into speci- 
field fields, the range may be specified, as shown here. 

I ' I _ 6j or j _ 5 5 6 0 i , etc. 

1 2 3 4 5 6 7 8 9 10 11 14 15 19 20 

To specify a single element of a table the user would simply place that 
series number in the second five digit field (right justified), i.e., 

12 

1 23456789 10 
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A blank in the fifth position of each 10 character field indicates 
that it is not a series of elements, but a single element. 

This example could be placed on a single series card. When a 
second, third or fourth card is required, it is imperative that the 
tenth position of each 10 character field be filled, otherwise the 
scanning terminates and the program assumes the end of the series. 

Detailed Description of Independent Variable Card, Type 5 

1. The first data field contains the name of an independent vari¬ 
able data file. This name must agree with a name in the table of names, 
listed on every run of the preprocessor. Refer to App 6 for the correct 
abbreviated spellings of these names. 

2. Field position 2 requires a 'P', 'N', or a blank. A 'P' or 'N' forces 
the sign of the coefficient of the entering variable such that it can only 
have a positive or negative effect on the regression result. If a variable 

is forced to have a positive coefficient and it would otherwise have a sig¬ 
nificant negative effect in the regression, the algorithm will exclude it 
from the regression. 

3. Field position 3 must contain the number of variables from the same 
family of variables that can be in the regression equation at any one time. 

Two variables belong to the same family if they were created from the same 
original independent variable by means of lagging. The original variable 
also belongs to the family. 

4. The fourth field position through the 16th is used in cases where 
an independent variable, such as a program variable, has a lagging effect. 

The first of the sixteen fields represents the maximum lag (or shift) that 
a variable may possess. For each position of these fields which is filled, 
a new variable will, essentially, be created. An illustration will help 

to clarify these data fields. 


Variable name Lag #1 Lag #2 
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In this example, three variables would be generated. The first would 
have a lag of 2 months, the second a lag of 1 month and the third would 
be the original variable, which is included automatically. During the 
course of the regression, only one of these variables could enter with 
either a positive or negative coefficient. 

Note: In order to properly align the data bases, the preprocessor 
searches all independent variable data cards and stores the maximum shift 
of all variables, and adjusts the starting point of the data bases. A 
maximum shift of two months would shorten the time span of a data base 
by two months. 


generated 

variables 




^2 

^3 


^5 

^6 

^7 

00 

> 

Month 1 








10 

2 








20 

3 








30 


Vi 

V L* 

8 2 

no 

no 

data 

data 

no 

no 

data 

data 

20 

10 


1 

2 

3 new starting point 


Therefore, labels generated would appear: IV , 2V , 3V^...8 VqL„, 9VoL-, 

i JL J o 2 o 1 

Sample input parameter listing 1: 

This example shows the relationship of each dependent to its indepen- 
^dent variable string. 


TAPI 

71 01 

FE^»ALES 

»»TrpT 

1 

0 0 

00 

1 

TAPI 

k 

P-HS-1-3 

FEMftLES 

1 

00 

00 

5 

TAPI 

20 

B-HS-4-5 

MALF^BLACK,HSG,CAT 

1 

1,2,3 

0 0 

00 

21 


MALE,BLACK,HSG,CAT 

4,5 



Where V^L- and stand for variable Vq lagged by zero, one 

o U o 1 o 2 o 

and two ninths, respectively. 
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TAPI 

?9 

TAPI 

45 

TAPI 

53 

TAPI 


W-HS-1-3 1 no 

MAL5,WHT'^P, HSGtCAT 1,2,3 
44 

W-HS-4-5 1 00 

male,WHIT^,HSG,CAT 4,5 
5? 

1 0 0 

MALE,NHSr,,PLACK,CAT 1,2,3 
6^ 101 116 li.q 164 

M-MHS-1-3 1 CO 

MALE,NHSG,WHITE,CAT 1,2,3 


77 92 

125 

140 

173 

13P 

END 





MICIPA 

P 

1 

-2 

-1 

PACUPG 

D 

1 



PEC ASS 

P 

1 

-2 

-1 

PECR 

P 

1 

-2 

-1 

time 


1 



PRTMPH 

P 

1 


-1 

PNSHSG 


1 



PMSKLl 

P 

1 



PNSKL2 

P 

1 



TYPPT 

P 

1 



ENP 





MICIPA 

P 

1 

-2 

-1 

PASURG 

P 

1 



RECASS 

P 

1 

-2 

-1 

OUMEMP 

P 

1 

-2 

-1 

oRTMpo 

P 

1 

-2 


PNSHSG 

p 

1 



PNSKLl 

P 

1 



PNSINC 

P 

1 



PAIPTV 

p 

1 

-2 

-1 

UOCCAN 

P 

1 

-2 

-1 

CAOPTS 

P 

1 

-2 

-1 

TYOPT 

P 

1 



END 





MICIPA 

P 

1 

-2 

-1 

PASURG 

P 

1 



RECASS 

P 

1 

-2 

-1 

PECR 

Q 

1 

-2 

-1 

OUNEMP 

P 

1 

-2 

-1 

PRIMED 

D 

1 

-2 

-1 

PAIOTV 

P 

1 

-2 

-1 

UOCCAN 

P 

1 

-2 

-1 

CAOPTS 

P 

1 

-2 

-1 

TYOPT 

D 

1 



PNSNHS 

P 

1 



CAT4LM 

P 

1 



END 
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MIC 7PA 

P 

1 

-2 

-1 

PASUPG 

P 

1 



RECASS 

o 

1 

-2 

-1 

ounfmp 

p 

1 

-E 


P PTMt^O 

p 

1 

-2 

-1 

9NSHSG 

p 

1 



ENSKLl 

p 

1 



BNSINC 

p 

1 



PAIDTV 

p 

1 

-2 

-1 

UDCCAN 

p 

1 

-2 


CAOPTS 

p 

1 

-2 

-1 

TYOPT 

p 

1 



EMH 





HICIPA 

p 

1 

-2 

-1 

PAEUPG 

p 

1 



PECASS 

p 

1 

-2 

-1 

RECP 

p 

1 

-2 

-1 

■ DUNFMp 

p 

1 

-2 

-1 

PPTMFO 

p 

1 

-2 

-1 

PATPTV 

p 

1 

-2 

-1 

UOCCAN 

p 

1 

-2 

-1 

CAOPTS 

p 

1 

-2 

-1 

TYOPT 

o 

1 



RNSNHS 

p 

i 



CAT4LM 

o 

1 



ENO 





MTCIPA 

p 

1 

-2 

-1 

OASUPG 

o 

1 



PECASS 

p 

1 

-2 

•1 

PECR 

o 

1 

-2 


nUNFMP 

p 

1 

-2 

-1 

PRTMED 

p 

1 

-2 

-1 

RNSNHS 

p 

1 



HSFL74 

N 

1 



UOCCAN 

P 

1 

-2 

-1 

OPTSTO 

P 

1 

-2 

-1 

aplcy 

N 

1 

* 


TYOPTN 

P 

1 



END 





TilCIPA 

P 

1 

-2 

-1 

PAEUPG 

P 

1 



RECASS 

P 

1 

-2 

-1 

RECR 

P 

1 

-2 

-1 

OUNFMp 

P 

1 

-2 

-1 

PRICED 

P 

1 

-2 

-1 

BNSNHS 

P 

1 



HSFL74 

N 

1 



UOCCAN 

P 

1 

-2 

-1 

OPTSTO 

P 

1 

-2 

-1 


flPLCY N 1 

TYOPTN P 1 

END 
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Sample input parameter listing 2; 

This example demonstrates the placement of transgeneration cards 
in a deck set-up. They always precede independent variables for the 
corresponding dependent variable. 


dAS£ 11 

rA32 6L.<-17-H5 i 


TITLE 

197 216 214 2 35 ..... ... 253 

£N0 

TRNGcN G2/ .. 33 j b3.. „ . ___ 

TRNG£Ni]i3 09:: 62. 

RECR P 1 

RcCASS o 1 -2 *1 

UOCCAM . ^ 1 -2 -1 

TYOPTN p 1 

CAOPTS P 1 -2 -1 

EMO ....... '. .. 


C2 0 1 




24 2 .2 5j 


In this simple example* there is only one dependent variable. There 
are five basic independent variables with six more generated through lags. 

The transgeneration cards will also create an extra variable. The 
dependent variable card specifies that there are two transgeneration cards, 
and that one new variable will be added, due to transgeneration. 

The formats of the TRNGEN cards are defined in App B. The indices 
referred to correspond to those assigned by BMD02R as follows with the lags 
indicated in parentheses: 


Index 

Variable 

1 

dependent variable 

2 

RECR 

3 

RECASS(-2) 

4 

RECASS(-l) 

5 

RECASS(0) 

6 

U0CCAN(-2) 

7 

UOCCAN(-l) 

8 

UOCCAN(O) 

9 

TYOPTN(O) 

10 

CAOPTS(-2) 

11 

CAOPTS(-2) 

12 

CAOPTS (0) 
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In the first TRNGEN card, 08 denotes the additive operation, 07 
the variable to be replaced, 06 the variable operated on, and 3.0 
a constant, Syinbolically, this is expressed by: 

X, + 3.0 - 

o 7 

where X^ and X^ refer to U0CCAN(-2) and UOCCAN(-l), respectively. In 
the second TRNGEN card, 09 denotes the multiplicative operation. Symboli¬ 
cally, this is expressed by 

2.0 -- Xj3 

where X- is as before and X,_ is a new variable. 

6 13 

POST PROCESSOR—GRAPH PROGRAM (PROGRAM #4) 

This routine is dependent upon the successful completion of the pre¬ 
processor program, to ensure creation of a file containing input data to 
this routine. In addition to this file, a minimum of 3 input parameter 
cards are required. 

The output of the program is a plot for each dependent variable 
selected for use in the preprocessor, as well as a deviations table list¬ 
ing the dependent variable values, regression equations estimates (Y), 
the residual error based on the two preceding quantities, and the percent 
error. 

All plots are assigned to an output file, as are the deviations tables, 
but the tables are only generated if requested in the parameter cards. A 
schematic of the post processor including both the graph (program #4) and 
the aggregating (program #5) programs is given in Fig. 12. 

Input Files 

TAPEl (MT12) . This file contains binary data written from the regression 
routine. It consists of data length, starting and ending dates, time-series 
data for the dependent variable, and projections. 

TAPES . This is the parameter card file and contains control information, 
such as today's data, whether graphs are to be printed, tape number of input 
data, name given to the run, and end of data identifier. 
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Output Files 

TAPEIO (MT14) . Contains the graphs- This allows the graphs to be 
retained for later printing, or if graphs are not desired, to be ignored. 

TAPE6 . This is the standard printer file and will contain titles, 
input parameter specifications, and deviations tables. 


Input Card Formats and Specifications 


Card # 

CC# 

Format 

Program 

name 

Description 

1 

1-12 

3A4 

DATE 

Today^s data (in any form). 

2 

1-2 

12 

IG 

Place'99^ if only graphs are 
desired, otherwise leave blank. 


6-10 

15 

I TAPE 

Tape # of input data, if other 
than TAPEl. 

3 

1-80 

20A4 

lASSUM 

Any alphanumeric description of 
this run. User can use as many 
cards here as he wishes. 

n 

1-4 

14 

IH9 

Must contain 9999 to indicate end 
of input data, where n is the 
number of data cards. 


The following is a listing of sample input data. 


MAP 2 1076 

1 

TEST OP OLOT ROUTINE 

9999 

POST PROCESSOR—AGGREGATION PROGRAM (PROGRAM #5) 

This routine, which is the last in the series of the NPSGM, aggregates 
the dependent variables selected for use in the preprocerror program into 
a maximum of four C-groups. Generated within this program are aggregated 
seasonal coefficients and consolidated historical and projection data for 
each of the C-groups defined by the user. 
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The outputs of the program consists of (a) one file containing 12- 
month forecasts (expressed in thousands) beginning on the first month 
after the end of the data base and (b) a punch file, used as input to 
the C0MPLIP-G2 model. The punch file contains the seasonal coefficients 
associated with each month of the forecast period for each C-group. 

Inputs to the program consists of a file containing projected time- 
series data ,and input parameter cards. The input file contains projections, 
organized in a similar manner as the dependent variable file in that there 
are as many 12-month groups of projections as there are dependent variable 
series on the first file. The card input consists of four types of cards. 
They are: (1) the title card, (2) user supplied estimates of accessions 
for the 12 future months of population groujP that are not included in the 
regression results, but are desired to be included in the aggregations 
for the C-groups to be used in C0MPLIP-G2, (3) control cards indicating 
the aggregations to be made, and (4) the END terminator card. 

Of a maximum of 20 possible dependent variables coming from the pre¬ 
processor, a total of four new C-groups may be created through aggregation. 
The program assumes that all variables coming in will be used in one of the 
C-groups, although it is not a fatal error if one is omitted. The use of 
any dependent variable in more than one C-group will result in a fatal 
diagnostic and a terminated run. 

This aggregation is done by listing the series, coming from input, 
that are to be aggregated into each of the new C-groups, therefore, a 
knowledge of the preprocessor output is essential for meaningful aggregation 

If the population groups defined by the dependent variables from the 
regression run do not span the entire population, then the user may input 
additional groups (similar to those from the regression program) by means 
of card input. The index numbers assigned to the user-supplied groups will 
follow those used for the dependent variables, and the total must not exceed 
20. For example, if seven dependent variables were used in the regression 
run (each defining a imique population group), then the first user-supplied 
group would have the index eight assigned to it. The indices are used in 
defining the aggregations. 

The card formats are as follows: 
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Title Card, Card Type 1 - FOR^IAT (15A4) 

This input consists of one card with the title to be printed on 
the reports. 

User-Supplied Population Groups, Card Type 2 - FORMAT (A3, F8.2 ) 

This input consists of 12 cards for each user supplied population 
group, one card for each of the projection months defined by the regression 
program. Within each group the cards must follow the same sequence projected 
by the regression program. That is, if the first projection month is July, 
then the cards are ordered as follows: July, August, September,May, 

June, for each group, one group following another group in succession. The 
last card of the last user-supplied group is followed by an END card (card 
type 4). The END card is read with the A3 format. 

C-Group Definition, Card Type 3 - FORMAT (A3, 7X, 7 (13, 7X) 

The use of as many cards as are necessary to list all of the dependent 
variables as user-supplied population groups that comprise each C-group is 
permitted until a maximum of 20 is reached or an END card is encountered. 

Each set of cards defining a C-group is followed by an END card. The END 
card is read with the A3 format. 

END, Card Type 4 

This is a delimiter card. The word END is punched in the first three card 
columns. The card is read with the format statements for card types 2 and 3. 
The following is a listing of a sample input data. 

TFST OF aGOFFGaTION ROUTINE UETMG 7 OEPENOENT VARIABLES 

Input for user defined group, 
group 8 


4 

5 6 7 


100 

200 

300 

400 

500 

600 

700 

800 

900 

1000 

1100 

1200 


END 

END 
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END 
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0.0 0 
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0.00 
0.00 
0.00 

1 

2 

3 

8 
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2 4 7 8 


12 car 
each u 
define 
group 


3 5 6 


Title card 



Fig. 13—Example of Input Cards for Aggregation Program 

In this case, 7 dependent variables came from the input file and two 
population groups were input by cards. 








Input Files 

TAPE2 (MTIO) > This is a BCD file containing the forecasts of the 
dependent variables that are to be aggregated. 

TAPES f This is the parameter card file containing control informa¬ 
tion supplied by the user. 

Output Files 

TAPEIO (MT13) . This, file contains the aggregated C-groups in the 
format required by the C0MPLIP-G2 matrix generator as input file TAPE17. 

TAPE7 . This file should be assigned to the system punch. It 
contains the seasonal coefficients for use by the C0MPLIP-G2 matrix 
generator. 
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Appendix A 


LISTING OF INDEPENDENT VARIABLE TIME SERIES 
(For definitions see Table 7 of Chap. 2) 


Note: Each time series is given in the following 
four columns: 

1. variable identifier 

2. month 

3. calendar year 

4. monthly value 
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SOURCES OF INDEPENDENT VARIABLES 


The sources are cited for those independent variables where it is not 
obvious (e.g., most policy variables reflect Department of the Army policies 
no source is cited for them). 

1. USAREC Req 680-1 is the source for the timing of the different options 
and the total options available to a recruit in any given month. This is 
the source for the following variables: 

CAOPTS, OPTSTO, TYOPT, TYOPTN 

2. N.W. Ayer Co., the Army advertising agency is the source for the 
print media variable: PRTMED. 

3. "Effectiveness of the Modem Volunteer Army Advertising Program," 
prepared by Stanford Research Institute for OSAMVA, December 1971, is 
the source for the paid TV variable: PAIDTV. 

4. "Employment and Earnings," Bureau of Labor Statistics, February 1973 

is the source of the seasonal adjustment factors and the unemployment rates. 
Tfie seasonal adjustment factors for 16-19 year old males were applied to 
the unemployment data on 16-21 year old males from Tables A-5 and A-7 
of th.e same monthly publication to derive the monthly estimates of season¬ 
ally adjusted xmemployment rates: DUNEMP. 
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TYOPT 

1 

73 

0.00 

TYOPT 

2 

73 

0.00 

TYOPT 

3 

73 

0 . 30 

TYOPT 

4 

73 

0. 00 

TYOPT 

5 

73 

0.00 

TYOPT 

6 

73 

0.00 

TYOPT 

7 

73 

0.00 

TYOPT 

a 

73 

0. 30 

TYOPT 

Q 

73 

0,00 

TYOPT 

10 

73 

. 20 

TYOPT 

11 

73 

.50 

TYOPT 

12 

73 ' 

.50 

TYOPT 

1 

74 

1.00 

TYOPT 

2 

74 

1. 00 

TYOPT 

7 

74 

1. 00 

TYOPT 

4 

74 

1.00 

TYOPT 

E 

74 

1.30 

TYOPT 

6 

74 

1.30 

TYOPT 

7 

74 

1.00 

TYOPT 

8 

74 

1.00 

TVOPT 

9 

74 

1.00 

TYOPT 

10 

74 

1.00 

TYOPT 

11 

74 

1.00 

TYOPT 

12 

74 

1.00 

TYOPT 

1 

7E 

. 80 

TYOPT 

2 

75 

.50 

TYOPT 

3 

75 

.50 

TYOPT 

4 

75 

.50 

TYOPT 

5 

75 

.50 

TYOPT 

6 

75 

.25 

TYOPT 

END 

7 

75 

0.00 
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TYOFTN 

1 

71 

c. oc 

TYOPTM 

2 

71 

n. DC 

TYOPTN 

•? 

71 

0.00 

TYCPTN 

4 

71 

0. GO 

TYPPTN 

c 

71 

0. GO 

TVOPTN 

6 

71 

0.00 

TYCPTN 

7 

71 

0. GO 

TYCPTN 

8 

71 

0.00 

TYCPTN 

q 

71 

0.00 

TYCPTN 

10 

71 

O.GO 

TYCPTN 

11 

71 

0.00 

TYCPTN 

12 

71 

0.00 

TYOPTN 

1 

72 

0 . 00 

TYOPTM 

2 

72 

0.00 

TYOPTN 

3 

72 

0.00 

TYCPTN 

4 

72 

0.00 

TYOPTN 

5 

7? 

0.00 

TYOPTN 

6 

72 

0.00 

TYCPTN 

7, 

72 

0.00 

TYCPTN 

8 

72 

0.00 

TYCPTN 

q 

72 

0.00 

TYOPTN 

10 

72 

0 . u 0 

TYCPTN 

11 

72 

0.00 

TYCPTN 

12 

72 

0.00 

TYOPTN 

1 

73 

0.00 

TYOPTN 

2 

73 

o.oc 

TYCPTN 

3 

73 

0.00 

TYCPTN 

4 

73 

0 . 00 

TYOPTN 

E 

73 

0 . 00 

TVOPTN 

6 

73 

0.00 

TYOPTN 

7 

73 

0.00 

TYCPTN 

8 

73 

0 . 00 

TYCPTN 

q 

73 

0.00 

TYCPTN 

10 

73 

.20 

TYOPTN 

11 

73 

1.00 

TYOPTN 

12 

73 

1.00 

TYCPTN 

1 

74 

1.00 

TYCPTN 

2 

74 

1.00 

TYOPTN 

7 

74 

l.OG 

TYOPTN 

4 

74 

1.00 

TYOPTN 

5 

74 

1.00 

TYO PTN 

6 

74 

1.00 

TYCPTN 

7 

74 

1.00 

TYCPTN 

8 

74 

1.00 

TYOPTM 

q 

74 

1.00 

TYOPTN 

10 

74 

1.00 

TYCPTN 

11 

74 

1.00 

TYOPTN 

12 

74 

1.00 

TYOPTM 

1 

75 

. 80 

TYOPTN 

2 

75 

.50 

TYCPTN 

7 

75 

.50 

TYCPTN 

4 

75 

.50 

TYOPTN 

5 

75 

.50 

TYOPTN 

6 

75 

. 25 

TYOPTN 

END 

7 

75 

0 . 00 
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U OCCAM 

tj OCCAM 

UOCCAN 

U OCCAM 

IJOCCAM 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

UOCCAN 

U or CAM 

UOCCAN 

END 


1 

71 

2 

71 


71 

4 

71 

5 

71 

6 

71 

7 

71 

8 

71 

9 

71 

10 

71 

11 

71 

12 

71 

1 

72 

2 

72 

3 

72 

4 

72 

5 

72 

6 

72 

7 

72 

« 

72 

9 

72 

10 

72 

11 

72 

12 

72 

1 

73 

2 

73 

3 

73 

4 

73 

5 

73 

6 

73 

7 

73 

8 

73 

9 

73 

10 

73 

11 

73 

12 

73 

1 

74 

2 

74 

3 

74 

4 

74 

5 

74 

6 

74 

7 

74 

8 

74 

9 

74 

10 

74 

11 

74 

12 

74 

1 

75 

2 

75 

3 

75 

4 

75 

5 

75 

6 

75 

7 

75 

8 

75 


0.00 

30.00 

33.00 

33. OO 

60.35 
46.75 
51.00 

73.35 
112.50 
133.35 
139.75 
129.00 
242.07 

331.25 
434.00 
649.30 

827.25 
933.3(3 

1192.25 
929.90 
717.00 
608.00 
513.00 
505.00 
483.00 
494.00 
485.00 
557.00 
575.00 
509.00 
686.00 
650.00 
540.00 
861.00 
1000.00 
1240.00 
1532.00 
1510.00 
1664,00 
j 634.30 
1582.00 
1307.OC 
1206.00 
675.00 
563.00 
559.00 
346.00 
250.00 
250.00 
72.00 
243.00 
285.00 
273.00 
36.00 
86.00 
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Appendix B 

BMD02R TRANSGENERATION CARDS 

(Reproduced from Reference 5, 
pages 15-21) 
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B. Transgeneration Cards 

The term transgeneration is used to include transformations of 
input variables and creation of new variables prior to the normal 
computations performed by the various programs. 

The transformations described below are performed on the values 
of the variables in each case. In these examples, the symbol x. 
will denote the i^^ variable as well as its value. 

Example s: 





^^4 

X ' 


X replaces X 

5 i 


^2 "'S- 


X^ + replaces 

^2 


By successive transformations, more complicated relationships 
may be obtained. For example: 
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^ O a. 


Section III 


Y Z 2 

four transformations are 

required: 


Variables as they are stored at 
each step 


Transformation 



''3 

^4 


• 




^3 

^4 


- 3 ^ - 



^2 

X 2 

^4 


XjfX3 - 



^2 

X ^ 
3 

^4 

Xi^+ Xj^ 

VXg 



''2 

X 2 

X 3 

^4 

t/Xi^X3" 


In this example, it can be seen that the original values of Xg 
are irrelevant. Actually the variable Xg may be a dummy 
variable introduced by the program specifically to provide 
capacity for creating new variables by trans gene ration. 
Dummy variables may be required for intermediate storage 
in order to effect some transformations. 

2 

(ii) To replace X^^ by exp (-1/2 X^ ) three transformations are 
required: 


Transformation 


''2 

^3 

^ 1 " - 

X^ 

^1" 

''2 

^3 

-1/2 Xj- 

-> Xj 

-1/2 Xj^ 

^2 

^3 

exp (Xj)- 


exp (-1/2 Xj^) 

^2 

^3 


(iii) To replace X^ by X 2 +log^(X^-X^+100) four transformations 
are required: 


Transformation 


^2 

^3 

^4 

V V —. 


X 

X 

X 

X 4 -X 3 

^4 ^3 


“^1 

^2 

^3 

v* X 1 nn ^ 

■ --V V 

X 

X 

X 

(X^-Xj+lOO) 

A . T 1UU 

4 

X 4 

^1 

2 1 

^3 
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Transformation 


^2 

^3 

^4 

*°gl0^4" 



^2 


>°8io(^4-''3+‘0'» 

X 2 +X 4 - 

->^4 


^2 

^3 

X2+logio(X4-X3+ 


The transformations are performed in the order in which the 
Transgeneration Cards appear, so that, for example, the two 
transgenerations ^ followed by ^ ^ ^1 result 

in 2 Xj- 2 , whereas X^- 2 —►X^ followed by 2X ^—► X^ will 


result in 2(Xj-2). 


TRANSGENERATION LIST 

Notation to be used in the following transgeneration list: 

i, j, k are variable indices (need not be different) 
c is a constant 

^ 2 ’ ^ 3 ’ 

n is the number of cases, or sample size 

- 1 ^ 

The mean X. = 


Code 


:.=i- 2 X.. 

[ 1 n _ 2 

n^ 2 ^^ji~ 

j = l 

Trans gene ration 


1/2 


Restriction 


01 

Vx"-x^ 

X, > 0 

1 ““ 

02 

V5r 

X. > 0 

1 ~ 

03 


X. > 0 

1 

04 

e ^ —►X, 
k 

- 

05 

arcsin "^X^—► X^^. 

0 < X. < 
«. 1 — 
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Code 

T ransgeneration 

Restriction 

06 

arcs in "\/X./(n+l) + arc sin *\y^X. + l)/(n+l) — 

1 1 xC 

0 < (X./n) < 1 
1 

07 

l/X.—>x, 

1 k 

X. 4= 0 

X 

08 

X.+ c->X 

1 k 

- 

09 

X.c —>X, 

1 k 

- 

10 

X.*^—*«X, 

1 k 

X. > 0 

X 

11 

x.+x.-^x, 

1 j k 

- 

12 

x.-x.— 

1 J k 

- 

13 

X.X.—^x, 

1 j k 

- 

14 

X./X.-^X, 

1 J k 

X. 4= 0 

J 

15 

If X. > c, 1->X ; 

- 


otherwise 0—^X, 
k 


16 

If X. ^ X., 1—>X, ; 

1 J k 

- 


otherwise 0 —>X, 

k 


17 

log,X.->X^ 

X. > 0 

X 

18 

x.-x.—>X, 

Ilk 

- 

19 

X./s.—>X. 

Ilk 

- 

20 

sin X.—>X, 

1 k 

- ■ 
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Code 

T ransgeneration 

.Restriction 

21 

cos X. ->X, 

1 k 

- 

22 

arctan X.~> X, 

1 k 

- 

23 

X.^j->X, 

1 k 

X.> 0 

1 

24 

c ^ ->X 

k 

c > 0 

25 

x.-^ X, 

1 k 

- 

26 

c^X, 

k 

(Leave code i blank) 

27-39 

Not defined 



40 


41 


If X. = a, or a_ or , a_, then c->X ; 

1 1 2 3 7 k 

otherwise X remains unchanged. 

xC 


If X. is blank, then c->X, ; 

1 k 

otherwise X remains unchanged. 

^i'Note that in reading numeric fields, a blank 
field and -0 are equivalent. 


(X. ^ -0)* 


42 If X. = a, or a., or a_ . . . , a^, then X.^X, ; 

1 1 2 3 7 j k 

otherwise X remains unchanged. 

43 If X. is blank, then X.^X ; (X. 4 -0) 

1 j k 1 

otherwise X remains unchanged. 

When a violation of a restriction in the right-hand column occurs during 
transgeneration, the program will print a diagnostic message. Most 
programs will proceed to the next problem, if any. Some programs will 
delete the case where the violation occurred and continue the computation. 
Other programs will screen all the input data for additional restriction 
violations before proceeding to the next problem, if any. 


1. Standard Transgeneration Cards 

Standard Transgeneration Cards are used with programs which 
use Standard Data Input (see Section II-C). Let p denote the 
number of variables in the data matrix and m the maximum 
number of variables allowed by the program for any problem. 
Any of the variables x^, . . . , x^^ maybe used in transgeneration. 
The initial values of the first p variables are read from the 
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input data file (Data Cards or Alternate Input Tape). The initial 
values of the remaining m-p variables are left over from previous 
calculations. After transgeneration of a particular case, the 
values of the first p+q variables for that case are used as the 
values of the transgenerated variables. If the p+q variables 
required for the computation are not the first p+q, they must be 
relocated. This may be done by using transgeneration code 
number 25. The numbers p and q (q may be positive, negative, or 
zero) are specified on the Problem Card. The indices i, j, and k 
from the transgeneration list may exceed p or p+q but must never 
exceed m. 

Card Preparation 

Col. 1-6 TRNGEN (Mandatory) 

Col. 7-9 Variable index k 

Col. 10,11 Code from transgeneration list (restricted by 

availability in particular program) 

Col. 12-14 Variable index i 

Col. 15-20 Variable index j or constant c 

Col. 21-25 Blank 

Col. 26 Number of a^'s for transformation 40 or 42 

Col. 27-32 a^^ value 

Col. 33-38 value 


Col. 63-68 value 

The constants c, a , . .. , a are punched with a decimal point if 
used with variables which have an F-type format and without a decimal 
point if used with variables which have an I-type format (see 
Section III-C). 

The Standard Transgeneration Cards for the three examples on pages 
16 and 17 are: 

(i) TRNGENOOI 100012. 0000 

TRNGEN003100032. 0000 
TRNGEN00511001000003 
TRNGEN00501005000000 

Revised: 9-1-65 
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(ii) TRNGENOOl 100012. 0000 
TRNGEN00109001-0. 500 
TRNGENOOl 04001000000 

(iii) TRNGEN00412004000003 
TRNGEN00408004100. 00 
TRNGEN00403004000000 
TRNGEN00411004000002 

2. Special Transgeneration Card 

This card is used only in those programs which require 
transformations of the form f{Y) ^Y. One "Special" Trans- 
generation Card will specify successive transformations. 

Card Preparation 


Col, 1-6 
Col. 7 
Col. 8, 9 
Col. 10-15 

Col. 16, 17 
Col. 18-23 


SPECTG (Mandatory) 

Number of transformations {< 8 for each card) 

Code* for the 1st trans gene ration 

Constant** for the 1st transgeneration (if none, 
leave blank) 

Code* for the 2nd transgeneration 
Constant** for the 2nd transgeneration 


Col. 64, 65 Code* for the 8th transgeneration 
Col. 66-71 Constant** for the 8th tra'nsgeneration 
As an example, if the user desires the transformation 
( 3 + 100)^—>Y 

he could accomplish this with the Special Transgeneration Card 
SPECTG5102. 000008-3. 0000100000008100. 00102. 0000 

* Code must be one of the set of permissible codes specified in 
the individual program description. 

** Keypunch decimal point. The decimal point is not required for 
right-justified integers, 
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